faster-whisper

heimoshuiyu/faster-whisper

Fork 0

28a4d11a73 Revert "Merge remote-tracking branch 'upstream/master' into prompt" prompt heimoshuiyu 2024-09-12 00:49:31 +08:00
6e42088656 Merge remote-tracking branch 'upstream/master' into prompt heimoshuiyu 2024-09-04 17:48:06 +08:00
d57c5b40b0 Remove the usage of transformers.pipeline from BatchedInferencePipeline and fix word timestamps for batched inference (#921) Mahmoud Ashraf 2024-07-27 05:02:58 +03:00
83a368e98a Make vad-related parameters configurable for batched inference. (#923) zh-plus 2024-07-24 10:00:32 +08:00
eb8390233c New PR for Faster Whisper: Batching Support, Speed Boosts, and Quality Enhancements (#856) Jilt Sebastian 2024-07-18 11:48:52 +02:00
4a59bb011d Merge remote-tracking branch 'upstream/master' into prompt heimoshuiyu 2024-07-10 10:16:35 +08:00
fbcf58bf98 Fix language detection with non-speech audio (#895) trungkienbkhn 2024-07-05 14:43:45 +07:00
1195359984 Filter out non_speech_tokens in suppressed tokens (#898) Jordi Mas 2024-07-05 09:43:11 +02:00
c22db5125d Bump version to 1.0.3 (#887) trungkienbkhn 2024-07-01 16:36:12 +07:00
8862bee1f8 Improve language detection when using clip_timestamps (#867) ABen 2024-07-01 17:12:45 +08:00
8d400e9870 Upgrade to Silero-Vad V5 (#884) Ki Hoon Kim 2024-07-01 17:40:37 +09:00
bced5f04c0 docs: add 'faster-whisper-server' community integration (#861) Fedir Zadniprovskyi 2024-06-05 08:27:41 -07:00
65551c081f Docker file improvements (#848) Fedir Zadniprovskyi 2024-05-19 19:13:19 -07:00
f53be1e811 Add distil models to WhisperModel init and download_model docstrings (#847) Napuh 2024-05-20 03:51:22 +02:00
4acdb5c619 Fix #839 incorrect clip_timestamps being used in model (#842) Natanael Tan 2024-05-17 17:35:07 +08:00
a1c3583c96 Update README.md (#841) Peter Krantz 2024-05-17 10:24:47 +02:00
2036d12634 Add Dockerfile example (#828) trungkienbkhn 2024-05-13 16:33:09 +07:00
2f6913efc8 Bump version to 1.0.2 (#816) trungkienbkhn 2024-05-06 09:02:54 +07:00
e11d58599d Allow av to include version 12. (#819) ddorian 2024-05-06 03:57:35 +02:00
49a80eb8a8 Clarify documentation for hotwords (#817) Keating Reid 2024-05-05 21:52:59 -04:00
8d5e6d56d9 Support initializing more whisper model args (#807) trungkienbkhn 2024-05-04 15:12:59 +07:00
6eec07739e Add benchmarking logic for memory, wer and speed (#773) trungkienbkhn 2024-05-04 15:12:43 +07:00
847fec4492 Feature/add hotwords (#731) jax 2024-05-04 16:11:52 +08:00
46080e584e Loosening tokenizers version constraint (#804) Keating Reid 2024-05-04 04:10:24 -04:00
3d1de60ef3 CUDA version and updated installation instructions (#785) Sidharth Rajaram 2024-05-04 01:09:59 -07:00
4ee1d54c14 Merge branch 'master' into prompt heimoshuiyu 2024-04-08 20:56:49 +08:00
91c8307aa6 make faster_whisper.assets as a valid python package to distribute (#772) (#774) master otakutyrant 2024-04-03 00:22:22 +08:00
b024972a56 Foolproof: Disable VAD if clip_timestamps is in use (#769) Purfview 2024-04-02 17:20:34 +01:00
8ae82c8372 Bugfix: code breaks if audio is empty (#768) Purfview 2024-04-02 17:18:12 +01:00
e0c3a9ed34 Update project github link to SYSTRAN (#746) trungkienbkhn 2024-03-27 14:31:17 +07:00
a67e0e47ae Add support for distil-large-v3 (#755) Sanchit Gandhi 2024-03-26 13:58:39 +00:00
1eb9a8004c Improve language detection (#732) trungkienbkhn 2024-03-12 21:44:49 +07:00
e50d82c18c Merge remote-tracking branch 'upstream/master' into prompt heimoshuiyu 2024-03-10 11:53:58 +08:00
a342b028b7 Bump version to 1.0.1 (#725) trungkienbkhn 2024-03-01 17:32:12 +07:00
5090cc9d0d Fix window end heuristic for hallucination_silence_threshold (#706) Purfview 2024-02-29 16:59:32 +00:00
09cd57e7f3 Fix typo 'ditil' (#721) Gabriel F 2024-02-29 13:08:58 -03:00
16141e65d9 Add pad_or_trim function to handle segment before encoding (#705) trungkienbkhn 2024-02-29 23:08:28 +07:00
4b64ef1f70 Merge branch 'master' into prompt heimoshuiyu 2024-02-23 10:52:53 +08:00
06d32bf0c1 Bump version to 1.0.0 (#696) trungkienbkhn 2024-02-22 15:49:01 +07:00
30d6043e90 Prevent infinite loop for out-of-bound timestamps in clip_timestamps (#697) Purfview 2024-02-22 08:48:35 +00:00
22c75d0cc3 Update README.md (#672) BBC-Esq 2024-02-21 04:18:11 -05:00
092067208b Add clip_timestamps and hallucination_silence_threshold options (#646) trungkienbkhn 2024-02-20 23:34:54 +07:00
6ffcbdfbc2 Fix typos in README.md (#668) Jordi Mas 2024-02-20 17:33:17 +01:00
52695567c9 Bumps up PyAV version to support Python 3.12.x (#679) Purfview 2024-02-20 16:31:07 +00:00
c6b28ed3a0 Update README.md (#685) IlianP 2024-02-20 17:28:00 +01:00
4ab646035f Upgrade ctranslate2 version to support CUDA 12 (#694) trungkienbkhn 2024-02-20 23:26:55 +07:00
d04e685ca2 Merge branch 'master' into prompt heimoshuiyu 2024-02-19 17:31:58 +08:00
f144e4c83d Expands the note for distil-whisper (#659) Purfview 2024-01-28 20:48:40 +00:00
3aec421849 Add: More clarity of what "max_new_tokens" does (#658) Purfview 2024-01-28 20:40:33 +00:00
64b9f244bd Whisper-Streaming mention (#656) Dominik Macháček 2024-01-25 18:27:27 +01:00
00efce1696 Bugfix: Illogical "Avoid computing higher temperatures on no_speech" (#652) Purfview 2024-01-24 10:54:43 +00:00
ad3c83045b support distil-whisper (#557) metame 2024-01-24 17:17:12 +08:00
72ff979a2e Add GUI faster-whisper project README.md (#554) Jürgen Fleiß 2024-01-18 13:01:02 +01:00
615de0d2d9 add WhisperLive to community integration (#647) makaveli 2024-01-18 17:24:14 +05:30
b835bdaaf1 Merge remote-tracking branch 'upstream/master' into prompt heimoshuiyu 2023-12-25 17:56:50 +08:00
44f7e58947 Update whisper-standalone-win description in README.md (#508) Purfview 2023-12-14 12:03:46 +00:00
ebcfd6b964 Fix broken prompt_reset_on_temperature (#604) Purfview 2023-12-13 12:14:39 +00:00
19329a3611 Word timing tweaks (#616) trungkienbkhn 2023-12-13 18:38:44 +07:00
65094b779e Update info on cuBLAS and cuDNN libs in README.md (#513) Purfview 2023-11-27 11:12:47 +00:00
9641d5f56a Force read-mode in av.open (#566) Clayton Yochum 2023-11-27 02:43:35 -07:00
e1a218fab1 Bump version to 0.10.0 Dang Chuan Nguyen 2023-11-24 23:19:47 +01:00
3084409633 Add V3 Support (#578) Oscaarjs 2023-11-24 23:16:12 +01:00
5a0541ea7d Bump version to 0.9.0 Guillaume Klein 2023-09-18 16:21:37 +02:00
e94711bb5c Add property WhisperModel.supported_languages (#476) Guillaume Klein 2023-09-14 17:42:02 +02:00
0048844f54 Expose function available_models (#475) Guillaume Klein 2023-09-14 17:17:01 +02:00
a49097e655 Add some missing typing annotations in transcribe.py Guillaume Klein 2023-09-12 15:45:54 +02:00
81086f6d33 Always run the encoder at the beginning of the loop (#468) Guillaume Klein 2023-09-12 14:44:37 +02:00
f697945691 Update tokenizers requirement to include version 0.14 (#469) Guillaume Klein 2023-09-12 14:44:22 +02:00
727ab81f31 Improve error message for invalid task and language parameters (#466) Guillaume Klein 2023-09-12 10:02:23 +02:00
0285d46f6f Add more details about the requirements in the README (#463) Guillaume Klein 2023-09-08 14:35:17 +02:00
ad388cd394 Bump version to 0.8.0 Guillaume Klein 2023-09-04 11:56:48 +02:00
4a41746e55 Log a warning when the model is English-only but the language is set to something else (#454) Guillaume Klein 2023-09-04 11:55:40 +02:00
1e6eb967c9 Add "large" alias for "large-v2" model (#453) Guillaume Klein 2023-09-04 11:54:42 +02:00
f0ff12965a Expose generation parameter no_repeat_ngram_size (#449) Guillaume Klein 2023-09-01 17:31:30 +02:00
5871858a5f Force the garbage collector to run after decoding the audio with PyAV (#448) Guillaume Klein 2023-09-01 15:25:13 +02:00
e87fbf8a49 Added audio duration after VAD to TranscriptionInfo object (#445) MinorJinx 2023-08-31 10:19:48 -05:00
7b271da035 docs: add wscribe to community integrations (#427) Hrishikesh Barman 2023-08-17 12:20:24 +05:30
1562b02345 added repetition_penalty to TranscriptionOptions (#403) Aisu Wata 2023-08-06 05:08:24 -03:00
1ce16652ee Adds DEBUG log message for prompt_reset_on_temperature (#399) Purfview 2023-08-04 08:06:17 +01:00
857be6f621 Rename clear_previous_text_on_temperature argument (#398) Purfview 2023-08-03 17:44:37 +01:00
1a1eb1a027 Add clear_previous_text_on_temperature parameter (#397) KH 2023-08-03 22:40:58 +09:00
5c17de1771 Bump version to 0.7.1 Guillaume Klein 2023-07-24 11:10:12 +02:00
0f55c436fe Invalidate the cached encoder output when no_speech threshold is met (#376) Guillaume Klein 2023-07-24 10:57:15 +02:00
e786e26f75 Return result with best log prob when all temperature fallbacks failed (#356) KH 2023-07-20 23:13:11 +09:00
687db319e0 Remove duplicate code (#359) KH 2023-07-18 23:03:01 +09:00
171d90dd1f Bump version to 0.7.0 Guillaume Klein 2023-07-18 15:23:47 +02:00
0e051a5b77 Prepend prefix tokens with the initial timestamp token (#358) Guillaume Klein 2023-07-18 15:22:39 +02:00
2a37390fed Minor reformatting in code snippet Guillaume Klein 2023-07-18 15:08:53 +02:00
3b4a6aa1c2 Improve timestamp heuristics (#336) Hoon 2023-07-05 22:16:53 +09:00
c7cb2aa8d4 Add support for using whisper models from Huggingface by specifying the model id. (#334) zh-plus 2023-07-03 23:40:10 +08:00
c0d93d0829 Avoid computing higher temperatures on no_speech segments (#225) Guillaume Klein 2023-07-03 10:20:36 +02:00
19c294f978 Squash long words at window and sentence boundaries (#226) Guillaume Klein 2023-07-03 10:20:20 +02:00
9f24e2c735 Merge branch 'master' into prompt heimoshuiyu 2023-06-24 18:03:05 +08:00
fee52c9229 Allow users to input an Iterable of token ids into initial_prompt (#306) FlippFuzz 2023-06-21 20:46:20 +08:00
efc4f61d85 Do not specify the vocabulary file extension in the download pattern (#311) Guillaume Klein 2023-06-20 10:53:11 +02:00
ad58ba26ab Fix typo (#304) kh 2023-06-16 14:37:45 +09:00
20d4e9418b Add Open-Lyrics as a community project. (#291) zh-plus 2023-06-10 14:22:29 +08:00
d4222da952 Update README with community repo using FW (#284) Antonio Zarauz Moreno 2023-06-07 11:30:53 +02:00
1bb7e33b93 Reformat code snippet in README Guillaume Klein 2023-05-24 18:22:44 +02:00
2a00621564 Bump version to 0.6.0 Guillaume Klein 2023-05-24 16:15:01 +02:00

Commit Graph Select branches Hide Pull Requests master prompt #1 Mono Color

Commit Graph

Select branches

Hide Pull Requests

master

prompt

#1