Commit graph

12777 commits

Author SHA1 Message Date
Sergey M․ b5c39537be
[noovo] Improve extraction (closes #12792) 2017-04-30 00:24:25 +07:00
Frederic Bournival 1c7c76e4fb
[noovo] Add extractor 2017-04-30 00:24:19 +07:00
John Hawkinson 557194591a
[washingtonpost] Add support for embeds (closes #12699) 2017-04-29 23:07:26 +07:00
Yen Chi Hsuan 27e70a8f6c Merge pull request #12869 from Tithen-Firion/cbc-update-tests
[cbc] update test cases
2017-04-29 21:34:18 +08:00
Sergey M․ a4c81e4968
[yandexmusic:playlist] Fix extraction for python 3 (closes #12888) 2017-04-29 20:23:26 +07:00
Sergey M․ 7986c3abcd
[anvato] Improve extraction (closes #12913)
* Promote to regular shortcut based extractor
* Add mcp to access key mapping table
* Add support for embeds extraction
* Add support for anvato embeds in generic extractor
2017-04-29 19:49:04 +07:00
Yen Chi Hsuan a1ebfd4494 Merge pull request #12854 from Tithen-Firion/appletrailer-test-fix
[appletrailers] update test cases
2017-04-29 19:24:38 +08:00
Yen Chi Hsuan d19093bd50 Merge pull request #12906 from Tithen-Firion/clean-html-fix
[utils] Fix inconsistent output of clean_html
2017-04-29 15:58:45 +08:00
Yen Chi Hsuan 24eb7c2578
[xtube] Fix extraction with non-standard JSON 'sources'
Closes #12734

Thanks @paulguy for the fix!
2017-04-29 15:55:08 +08:00
Sergey M․ e7db6759e4
[downloader/external] Properly handle live stream downloading cancellation (closes #8932) 2017-04-29 04:33:35 +07:00
Sergey M․ b364c87c42
[tvplayer] Fix extraction (closes #12908) 2017-04-29 03:46:08 +07:00
Tithen-Firion edd9221cd2 [utils] Fix inconsistent output of clean_html
`\s` in Python 2.x doesn't match unicode whitespace characters by
default
2017-04-28 17:34:27 +02:00
Sergey M․ bc8a2ea071
release 2017.04.28 2017-04-28 18:30:03 +07:00
Remita Amine 20783b8b50 [aenetworks] fix extraction for shows with single season 2017-04-28 12:04:56 +01:00
Remita Amine bf2a5555c0 [go] add support for Disney, DisneyJunior and DisneyXD show pages 2017-04-28 09:48:52 +01:00
Remita Amine fb8e8b2d16 [adobepass] use geo verification headers for all requests 2017-04-28 09:48:52 +01:00
Yen Chi Hsuan b62985a9a5
[youtube] Recognize another HTML5 player URL (#12885) 2017-04-28 16:25:04 +08:00
Yen Chi Hsuan e31fed95b4
[youtube] Recognize new locale-based player URLs (fixes #12885) 2017-04-28 15:48:30 +08:00
Tithen-Firion 3fd0f70f6a [cbslocal] Update test 2017-04-28 04:26:59 +07:00
Tithen-Firion 33c62efc32 [collegerama] Update tests 2017-04-28 04:00:49 +07:00
Tithen-Firion 6b4ddd336c [afreecatv] Fix title extraction 2017-04-28 04:00:15 +07:00
Tithen-Firion c12b4b80f8 [archiveorg] Update test 2017-04-28 03:48:32 +07:00
Tithen-Firion 064fafe932 [appleconnect] Update test 2017-04-28 03:47:25 +07:00
Tithen-Firion ac1a5b9a12 [audioboom] Update test 2017-04-28 03:36:28 +07:00
Tithen-Firion a15777491a [atresplayer] Update test 2017-04-28 03:32:25 +07:00
Tithen-Firion d8571dd6bf [bleacherreport] Update tests 2017-04-28 03:28:26 +07:00
Sergey M․ c0fa4245ce
[downloader/fragment] Remove assert for resume_len when no fragments downloaded
This may be incorrect due some header (e.g. flv header in f4m downloader)
2017-04-28 03:26:19 +07:00
Tithen-Firion 8814ae42bc [beeg] Update test 2017-04-28 03:14:11 +07:00
Tithen-Firion 0f63dc2402 [bandcamp] Update test 2017-04-28 03:13:12 +07:00
Tithen-Firion dde97ea8da [canalc2] Update test 2017-04-28 03:07:42 +07:00
Sergey M․ c89b49f743
[extractor/common] Add manifest_url for explicit group rendition formats 2017-04-28 03:00:14 +07:00
Tithen-Firion 6f4a888416 [br] Update test 2017-04-28 02:53:11 +07:00
Tithen-Firion f5edd7ae51 [clipfish] Update test 2017-04-28 02:51:30 +07:00
Tithen-Firion c95e2b5911 [cbc] update test cases 2017-04-27 18:07:07 +02:00
Sergey M․ ff99fe529e
Don't list master m3u8 playlists in format list (closes #12832) 2017-04-27 21:53:17 +07:00
Tithen-Firion 76c1951036 [appletrailers] update test cases 2017-04-27 10:04:21 +02:00
Lucas M e8bfe2a946 [streamable] Add support for new embedded URL schema 2017-04-26 23:39:53 +07:00
Sergey M․ 3dc8b61b7f
[arte:+7] Relax _VALID_URL (closes #12837) 2017-04-26 01:55:29 +07:00
Sergey M․ a82f41841d
release 2017.04.26 2017-04-26 00:06:12 +07:00
Sergey M․ ac9c69ace7
[extractor/common] Improve jwplayer regex 2017-04-25 23:46:05 +07:00
Sergey M․ 85f6de25e4
[downloader/fragment] Clarify current_fragment's index and mark as experimental 2017-04-25 23:33:35 +07:00
Yen Chi Hsuan 01cb57016f
[iqiyi] Fix extraction of Yule videos 2017-04-25 22:23:57 +08:00
Sergey M․ 290f64dbaa
[downloader/fragment] Improve .ytdl format and start documenting 2017-04-24 23:50:20 +07:00
Sergey M․ adb4b03cd5
[downloader/fragment] Don't process ytdl file when it's not needed yet 2017-04-24 23:05:56 +07:00
Sergey M․ 0eee52f34b
Introduce --keep-fragments 2017-04-24 03:09:08 +07:00
Sergey M․ d3f0687cf7
[downloader/fragment] Use temp file for current fragment 2017-04-24 02:54:17 +07:00
Sergey M․ a4d6cf970c
[YoutubeDL] Fix output template for missing timestamp (closes #12796) 2017-04-24 00:50:39 +07:00
Sergey M․ 3019cb0c99
[extractor/common] Rephrase comment 2017-04-23 11:52:07 +07:00
Sergey M․ ddd258f922
[test_InfoExtractor] Add m3u8 parsing test for NAME attribute in EXT-X-STREAM-INF tag 2017-04-23 11:49:57 +07:00
Sergey M․ 07ad0cf34f
[vidio] Improve and sort formats 2017-04-23 11:48:51 +07:00
Sergey M․ 9c99bef704
[extractor/common] Use float for scaled tbr 2017-04-23 11:33:49 +07:00
Remita Amine ffbc8386b9 [brightcove] match only video elements with data-video-id attribute 2017-04-22 22:26:20 +01:00
Remita Amine 4abdba643c [downloader/fragment] remove unused code 2017-04-22 18:19:47 +01:00
Remita Amine 3e0304fe6e [downloader/fragment] use the documented names for fragment progress_hooks fields 2017-04-22 16:42:24 +01:00
Yen Chi Hsuan fbf56be213
[iqiyi] Fix playlist detection (#12504) 2017-04-22 22:11:37 +08:00
Yen Chi Hsuan 54f54fcca7
[socks] Report errors elegantly when credentails are required but missing
In some non-standard implementations, the server may respond AUTH_USER_PASS
even if's not listed in available authentication methods. (it should
respond AUTH_NO_ACCEPTABLE per standards)
2017-04-22 21:48:41 +08:00
Yen Chi Hsuan facfd79f9a
[azubu] Remove extractor as the site is gone (closes #12813) 2017-04-22 21:20:25 +08:00
Yen Chi Hsuan 3110bb937d
[porn91] Fix extraction (closes #12814) 2017-04-22 21:16:36 +08:00
Sergey M․ cb2520802d
[extractor/common] Improve m3u8 extraction (closes #12211)
* Extract m3u8 parsing to separate method
* Improve rendition groups extraction
* Build stream name according stream GROUP-ID
* Ignore reference to AUDIO group without URI when stream has no CODECS
+ Add test coverage for parsing m3u8 from #11507, #11995, #12211 and twitch vod
2017-04-22 07:01:00 +07:00
Sergey M․ f779958250
[vidzi] Fix extraction (closes #12793) 2017-04-21 23:37:06 +07:00
Remita Amine 8abc7dca39 [amp] extract error message(closes #12795) 2017-04-20 05:16:41 +01:00
Remita Amine ea0c2f219c [downloader/fragment] use a general file to store fragment download context 2017-04-19 18:53:15 +01:00
Sergey M․ 481ef51e23
[brightcove] PEP 8 2017-04-19 21:47:03 +07:00
Remita Amine 5b995f713b [utils] add support for ttml styles 2017-04-19 14:38:40 +01:00
Remita Amine 75a2485407 [fragment,hls,f4m,dash,ism] improve fragment downloading
- resume immediately
- no need to concatenate segments and decrypt them on every resume
- no need to save temp files for segments

and for hls downloader:
- no need to download keys for segments that already downloaded
2017-04-19 11:46:07 +01:00
Remita Amine 58f6ab72ed [odnoklassniki] update tests 2017-04-19 00:16:55 +01:00
Sergey M․ 2dc48df5bc
[xfileshare] Add support for gorillavid.com and daclips.com (closes #12776) 2017-04-18 23:58:37 +07:00
Sergey M․ 18848d226a
[instagram] Fix extraction (closes #12777) 2017-04-18 22:40:26 +07:00
Sergey M․ a32a9a7ef5
[extractor/common] Add support multiple getters in try_get 2017-04-18 22:39:58 +07:00
Sergey M․ bae1404893
[extractor/common] Add support for video of WebPage context in _json_ld (closes #12778) 2017-04-18 22:21:38 +07:00
Yen Chi Hsuan 06d0ad9a4e
[brightcove] Support URLs with bcpid instead of playerID
Fixes #12482
2017-04-18 23:04:22 +08:00
Sergey M․ f631b55791
[brightcove] Fix _extract_url (closes #12782) 2017-04-18 21:46:25 +07:00
Remita Amine bf1b87cd91 [common] Relax JWPlayer regex and remove duplicate urls(#12768) 2017-04-17 08:48:24 +01:00
Remita Amine 1c35b3da44 [odnoklassniki] extract m3u8 formats 2017-04-16 21:27:08 +01:00
Sergey M․ 16a09aefe3
release 2017.04.17 2017-04-17 00:46:32 +07:00
Sergey M․ ab87c26009
[extractor/generic] Add test for #12761 2017-04-17 00:36:59 +07:00
Sergey M․ 91bc57e4c5
[limelight] Add support for channels and channels lists embedded using LimelightEmbeddedPlayerFlash (#12761) 2017-04-17 00:33:47 +07:00
Sergey M․ e5d39886ec
[limelight] Improve embeds extraction (closes #12761)
* Move extraction code to extractor
* Add extraction for LimelightEmbeddedPlayerFlash embeds
* Extract multiple video
2017-04-17 00:23:16 +07:00
Sergey M․ 751c89a27d
[itv] Extract series metadata 2017-04-16 23:19:20 +07:00
Sergey M․ c2d7d76efd
[itv] Fix rtmp formats (#12759) 2017-04-16 23:15:24 +07:00
Sergey M․ f67177cae8
[itv] Use native hls 2017-04-16 21:52:45 +07:00
Sergey M․ 5935ef3c5d
[itv] Lower preference for rtmp formats (closes #12759) 2017-04-16 21:52:07 +07:00
remitamine 72950c4dce Merge pull request #12752 from triple-j/go90_improvements_pull_request
[go90] Improve extraction
2017-04-15 23:48:13 +02:00
Jeremie J. Jarosh 334f41e0d8 [go90] Improve extraction
- add metadata for 'series', 'episode', 'season', 'season_id',
  'season_number', and 'episode_number'
- integrate series title into the title
- extract subtitles  (fallback to `vtt` if the subtitle file type
  detection fails as that is the most likely extension)
2017-04-15 16:32:00 -05:00
Sergey M․ 022250a594
release 2017.04.16 2017-04-16 01:49:02 +07:00
Sergey M․ d35dc344af
[YoutubeDL] Apply expand_path after output template substitution
os.path.expandvars translates '%%' into '%' making output template invalid.
Before: '%%(ext)s' -(expand path)-> '%(ext)s' -(outtmpl subst.)-> 'mp4'
After: '%%(ext)s' -(outtmpl subst.)-> '%(ext)s' -(expand path)-> '%(ext)s'
2017-04-16 01:39:07 +07:00
Sergey M․ 0563f7ac6e
[YoutubeDL] Propagate overridden metadata to IE results of type url (closes #11163) 2017-04-16 00:57:36 +07:00
Sergey M․ 413c1f8e2f
[extractor/generic] Extract RSS entries as url_transparent (#11163) 2017-04-16 00:57:32 +07:00
Sergey M․ 8068296276
[streamango] Improve extraction (closes #12643) 2017-04-15 21:50:51 +07:00
Marvin Ewald 4db79fa1bc
[streamango] Add extractor 2017-04-15 21:50:46 +07:00
Sergey M․ b2a19e3829
[wsj] Improve and modernize (closes #12558) 2017-04-15 20:59:05 +07:00
John Hawkinson 3266d08af2
[wsj:article] Add extractor 2017-04-15 20:58:57 +07:00
Sergey M․ 0254f93b08
[brightcove] Validate ambiguous embeds' URLs (closes #9163, closes #12005, closes #12178, closes #12480) 2017-04-15 20:31:49 +07:00
John Hawkinson 40158f55c9
[brightcove] Relax video tag embeds extraction
BrightcoveNewIE.extract_urls(): Handle player data when it is all in
attributes of the <video> tag, and also the id is
data-brightcove-video-id not data-video-id.

Add test to generic extractor, note HEAD gives 404, no checksum computed(?).
2017-04-15 20:31:43 +07:00
Sergey M․ 3dfceb286c
[udemy] Add support for react rendition (closes #12744) 2017-04-15 03:26:19 +07:00
Sergey M․ 57e2cb6c3a
[curiositystream] PEP 8 2017-04-15 02:33:26 +07:00
Sergey M․ 6f1b2374af
[canalplus] PEP 8 2017-04-15 02:33:12 +07:00
Sergey M․ d24bd4ffdb
release 2017.04.15 2017-04-15 01:39:33 +07:00
Sergey M․ 34c3bbe1bd
[youku] Keep old fileid extraction code as fallback (#12741) 2017-04-15 01:28:02 +07:00
xuhaomin e5106ba402 [youku] Fix fileid extraction 2017-04-14 13:23:02 -05:00