Commit graph

995 commits

Author SHA1 Message Date
=?UTF-8?q?Sergey=20M=E2=80=A4?= 3302ba8ad7 [test_execution] Add test for lazy extractors (refs #28780) 2021-10-23 01:06:54 +02:00
Lauren Liberda f38b6982dd fix m3u8 parsing test 2021-10-23 01:06:53 +02:00
Lauren Liberda 99ae610f74 fix links to ytdl issues 2021-10-23 01:06:53 +02:00
Laura Liberda 50d15ce421 remove now-invalid unicode_literals test 2021-10-23 01:06:53 +02:00
Laura Liberda 0b4878715c fix hdl tests 2021-10-23 01:06:53 +02:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= a17fa78858 Introduce --output-na-placeholder (closes #27896) 2021-10-23 01:06:52 +02:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 6ec432f9e4 [YoutubeDL] Raise syntax error for format selection expressions with …
…multiple + operators (closes #27803)
2021-10-23 01:06:52 +02:00
nixxo a447f25d5c [rai] improve subtitles extraction (#27705)
closes #27698
2021-10-23 01:06:52 +02:00
Remita Amine 95a9d868f6 [utils] add a function to clean podcast URLs 2021-10-23 01:06:52 +02:00
Remita Amine d918ab8191 [utils] accept only supported protocols in url_or_none 2021-10-23 01:06:52 +02:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 93dd9a4b58 [test_InfoExtractor] PEP 8 2021-10-23 01:06:51 +02:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 1060552241 [extractor/common] Improve JSON-LD interaction statistic extraction (…
…refs #23306)
2021-10-23 01:06:51 +02:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 0b84ab95e5 [YoutubeDL] Fix --ignore-errors for playlists with generator-based en…
…tries of url_transparent (closes #27064)
2021-10-23 01:06:50 +02:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 64e706884a [utils] Skip ! prefixed code in js_to_json 2021-10-23 01:06:50 +02:00
Laura Liberda 51eefea363 test_download: adjust tests to the environments properly 2021-02-24 17:46:00 +01:00
Laura Liberda 7fd9596a0f fix tests on envs with lazy_extractors builded 2021-01-19 18:44:27 +01:00
Laura Liberda 9224bfe84f [playwright] fix tests 2021-01-18 22:57:17 +01:00
Laura Liberda c53f744097 playwright wrapper (#28) 2021-01-18 22:39:18 +01:00
Laura Liberda b2f578c3c6 fix youtube url tests 2021-01-10 03:56:47 +01:00
Laura Liberda 889005bab3 selfhosted extractors, peertube extractor reworked (#10) 2020-12-09 21:52:30 +01:00
Laura Liberda 2c9b034593 [test] fix youtube format selection test
it's up to the current standards now
2020-11-17 16:56:26 +01:00
Dominika a661491bcf removed more SWF references from youtube extractor 2020-11-17 13:56:32 +01:00
selfisekai ede99f9f13 remove youtube:live, fix tests 2020-11-15 16:41:15 +01:00
selfisekai 2ed40bbe22 [test] don't check for duplicate url matches for GenericIE tests if test forces generic extractor 2020-11-08 23:19:16 +01:00
Dominika 1c397d1dd3 version 2020.11.01 2020-11-01 05:17:15 +01:00
Dominika 389d4402eb disable test_update 2020-11-01 04:55:45 +01:00
Dominika da166266b8 temporary ci fix? 2020-11-01 04:52:43 +01:00
Dominika ba0734b984 test fixes 2020-11-01 04:46:14 +01:00
Dominika eacab4c6a3 bulk project name change, expect bugssssss 2020-11-01 04:39:17 +01:00
Kevin O'Connor 4eda10499e
[utils] Don't attempt to coerce JS strings to numbers in js_to_json (#26851)
The current logic in `js_to_json` tries to rewrite octal/hex numbers to
decimal. However, when the logic actually happens the `"` or `'` have
already been trimmed off. This causes what were originally strings, that
happen to look like octal/hex numbers, to get rewritten to decimal and
returned as a number rather than a string.

In practive something like:

```js
{
  "0x40": "foo",
  "040": "bar",
}
```

would get rewritten as:

```json
{
  64: "foo",
  32: "bar
}
```

This is problematic since this isn't valid JSON as you cannot have
non-string keys.
2020-10-18 00:10:41 +07:00
Sergey M․ 1d9bf655e6
[utils] Recognize wav mimetype (closes #26463) 2020-09-06 11:19:53 +07:00
Sergey M․ 84213ea8d4
[youtube] Extract chapters from JSON (closes #24819) 2020-06-06 04:22:10 +07:00
Sergey M․ c380cc28c4
[utils] Improve cookie files support
+ Add support for UTF-8 in cookie files
* Skip malformed cookie file entries instead of crashing (invalid entry len, invalid expires at)
2020-05-05 04:21:25 +07:00
Sergey M․ e40c758c2a
[youtube] Improve player id extraction and add tests 2020-05-02 07:18:08 +07:00
Sergey M․ 042b664933
Revert "[utils] Add support for cookies with spaces used instead of tabs"
According to [1] TABs must be used as separators between fields.
Files produces by some tools with spaces as separators are considered
malformed.

1. https://curl.haxx.se/docs/http-cookies.html

This reverts commit cff99c91d1.
2020-03-10 04:53:51 +07:00
Sergey M․ cff99c91d1
[utils] Add support for cookies with spaces used instead of tabs 2020-03-08 18:01:32 +07:00
Sergey M․ ea17979d83
[test_subtitles] Remove obsolete test 2020-02-29 22:08:43 +07:00
Sergey M․ 4e9e1e240d
[test_YoutubeDL] Add tests for #10591 (closes #23873) 2020-02-15 03:37:31 +07:00
Sergey M․ e0abaab293
[test_YoutubeDL] Fix get_ids 2020-02-15 03:37:25 +07:00
Sergey M․ 42db58ec73
[utils] Improve str_to_int 2019-12-15 23:15:24 +07:00
Remita Amine 348c6bf1c1 [utils] handle int values passed to str_to_int 2019-11-29 17:39:18 +01:00
Sergey M․ 1ced222120
[utils] Add generic caesar cipher and rot47 2019-11-27 02:26:42 +07:00
InfernalUnderling 9d30c2132a [utils] Handle rd-suffixed day parts in unified_strdate (#23199) 2019-11-27 00:08:37 +07:00
Remita Amine 237513e801 [yahoo] restore support for cbs suffixed URLs 2019-10-31 07:38:53 +01:00
Sergey M․ 824fa51165
[utils] Improve subtitles_filename (closes #22753) 2019-10-18 04:03:53 +07:00
Sergey M․ 28cc2241e4
[utils] Restrict parse_codecs and add theora as known vcodec (#21381) 2019-06-14 01:56:17 +07:00
Sergey M․ 53cd37bac5
[utils] Improve strip_or_none 2019-05-24 00:03:01 +07:00
Sergey M․ 3089bc748c
Fix W504 and disable W503 (closes #20863) 2019-05-11 03:57:40 +07:00
Jakub Wilk fd35d8cdfd [utils] Transliterate "þ" as "th" (#20897)
Despite visual similarity "þ" is unrelated to "p".
It is normally transliterated as "th":

    $ echo þ-Þ | iconv -t ASCII//TRANSLIT
    th-TH
2019-05-11 01:42:31 +07:00
Sergey M․ 5e1271c56d
[utils] Improve int_or_none and float_or_none (#20403) 2019-03-23 01:08:54 +07:00