Commit graph

995 commits

Author SHA1 Message Date
=?UTF-8?q?Sergey=20M=E2=80=A4?= 63c541a3cd [test_execution] Add test for lazy extractors (refs #28780) 2021-05-03 22:57:13 +02:00
Lauren Liberda a4a4af8546 fix m3u8 parsing test 2021-03-16 22:23:26 +01:00
Lauren Liberda ec0abef671 fix links to ytdl issues 2021-03-04 14:22:51 +01:00
Laura Liberda 6217437dc2 remove now-invalid unicode_literals test 2021-02-27 00:06:15 +01:00
Laura Liberda 293eada0f4 fix hdl tests 2021-02-26 18:39:45 +01:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 1752b8b8c8 Introduce --output-na-placeholder (closes #27896) 2021-02-26 16:17:36 +01:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 5b7e334c86 [YoutubeDL] Raise syntax error for format selection expressions with …
…multiple + operators (closes #27803)
2021-02-26 16:12:06 +01:00
nixxo 3311732cc4 [rai] improve subtitles extraction (#27705)
closes #27698
2021-02-26 16:10:40 +01:00
Remita Amine 017215032a [utils] add a function to clean podcast URLs 2021-02-26 16:06:12 +01:00
Remita Amine 2c4b3dd864 [utils] accept only supported protocols in url_or_none 2021-02-26 15:53:27 +01:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 541e22037b [test_InfoExtractor] PEP 8 2021-02-26 15:15:40 +01:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= e4b993e9db [extractor/common] Improve JSON-LD interaction statistic extraction (…
…refs #23306)
2021-02-26 15:14:10 +01:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= 8175a5e8b1 [YoutubeDL] Fix --ignore-errors for playlists with generator-based en…
…tries of url_transparent (closes #27064)
2021-02-26 14:19:38 +01:00
=?UTF-8?q?Sergey=20M=E2=80=A4?= bc38ef9445 [utils] Skip ! prefixed code in js_to_json 2021-02-26 14:10:46 +01:00
Laura Liberda 51eefea363 test_download: adjust tests to the environments properly 2021-02-24 17:46:00 +01:00
Laura Liberda 7fd9596a0f fix tests on envs with lazy_extractors builded 2021-01-19 18:44:27 +01:00
Laura Liberda 9224bfe84f [playwright] fix tests 2021-01-18 22:57:17 +01:00
Laura Liberda c53f744097 playwright wrapper (#28) 2021-01-18 22:39:18 +01:00
Laura Liberda b2f578c3c6 fix youtube url tests 2021-01-10 03:56:47 +01:00
Laura Liberda 889005bab3 selfhosted extractors, peertube extractor reworked (#10) 2020-12-09 21:52:30 +01:00
Laura Liberda 2c9b034593 [test] fix youtube format selection test
it's up to the current standards now
2020-11-17 16:56:26 +01:00
Dominika a661491bcf removed more SWF references from youtube extractor 2020-11-17 13:56:32 +01:00
selfisekai ede99f9f13 remove youtube:live, fix tests 2020-11-15 16:41:15 +01:00
selfisekai 2ed40bbe22 [test] don't check for duplicate url matches for GenericIE tests if test forces generic extractor 2020-11-08 23:19:16 +01:00
Dominika 1c397d1dd3 version 2020.11.01 2020-11-01 05:17:15 +01:00
Dominika 389d4402eb disable test_update 2020-11-01 04:55:45 +01:00
Dominika da166266b8 temporary ci fix? 2020-11-01 04:52:43 +01:00
Dominika ba0734b984 test fixes 2020-11-01 04:46:14 +01:00
Dominika eacab4c6a3 bulk project name change, expect bugssssss 2020-11-01 04:39:17 +01:00
Kevin O'Connor 4eda10499e
[utils] Don't attempt to coerce JS strings to numbers in js_to_json (#26851)
The current logic in `js_to_json` tries to rewrite octal/hex numbers to
decimal. However, when the logic actually happens the `"` or `'` have
already been trimmed off. This causes what were originally strings, that
happen to look like octal/hex numbers, to get rewritten to decimal and
returned as a number rather than a string.

In practive something like:

```js
{
  "0x40": "foo",
  "040": "bar",
}
```

would get rewritten as:

```json
{
  64: "foo",
  32: "bar
}
```

This is problematic since this isn't valid JSON as you cannot have
non-string keys.
2020-10-18 00:10:41 +07:00
Sergey M․ 1d9bf655e6
[utils] Recognize wav mimetype (closes #26463) 2020-09-06 11:19:53 +07:00
Sergey M․ 84213ea8d4
[youtube] Extract chapters from JSON (closes #24819) 2020-06-06 04:22:10 +07:00
Sergey M․ c380cc28c4
[utils] Improve cookie files support
+ Add support for UTF-8 in cookie files
* Skip malformed cookie file entries instead of crashing (invalid entry len, invalid expires at)
2020-05-05 04:21:25 +07:00
Sergey M․ e40c758c2a
[youtube] Improve player id extraction and add tests 2020-05-02 07:18:08 +07:00
Sergey M․ 042b664933
Revert "[utils] Add support for cookies with spaces used instead of tabs"
According to [1] TABs must be used as separators between fields.
Files produces by some tools with spaces as separators are considered
malformed.

1. https://curl.haxx.se/docs/http-cookies.html

This reverts commit cff99c91d1.
2020-03-10 04:53:51 +07:00
Sergey M․ cff99c91d1
[utils] Add support for cookies with spaces used instead of tabs 2020-03-08 18:01:32 +07:00
Sergey M․ ea17979d83
[test_subtitles] Remove obsolete test 2020-02-29 22:08:43 +07:00
Sergey M․ 4e9e1e240d
[test_YoutubeDL] Add tests for #10591 (closes #23873) 2020-02-15 03:37:31 +07:00
Sergey M․ e0abaab293
[test_YoutubeDL] Fix get_ids 2020-02-15 03:37:25 +07:00
Sergey M․ 42db58ec73
[utils] Improve str_to_int 2019-12-15 23:15:24 +07:00
Remita Amine 348c6bf1c1 [utils] handle int values passed to str_to_int 2019-11-29 17:39:18 +01:00
Sergey M․ 1ced222120
[utils] Add generic caesar cipher and rot47 2019-11-27 02:26:42 +07:00
InfernalUnderling 9d30c2132a [utils] Handle rd-suffixed day parts in unified_strdate (#23199) 2019-11-27 00:08:37 +07:00
Remita Amine 237513e801 [yahoo] restore support for cbs suffixed URLs 2019-10-31 07:38:53 +01:00
Sergey M․ 824fa51165
[utils] Improve subtitles_filename (closes #22753) 2019-10-18 04:03:53 +07:00
Sergey M․ 28cc2241e4
[utils] Restrict parse_codecs and add theora as known vcodec (#21381) 2019-06-14 01:56:17 +07:00
Sergey M․ 53cd37bac5
[utils] Improve strip_or_none 2019-05-24 00:03:01 +07:00
Sergey M․ 3089bc748c
Fix W504 and disable W503 (closes #20863) 2019-05-11 03:57:40 +07:00
Jakub Wilk fd35d8cdfd [utils] Transliterate "þ" as "th" (#20897)
Despite visual similarity "þ" is unrelated to "p".
It is normally transliterated as "th":

    $ echo þ-Þ | iconv -t ASCII//TRANSLIT
    th-TH
2019-05-11 01:42:31 +07:00
Sergey M․ 5e1271c56d
[utils] Improve int_or_none and float_or_none (#20403) 2019-03-23 01:08:54 +07:00