trungkienbkhn
8d5e6d56d9
Support initializing more whisper model args ( #807 )
2024-05-04 15:12:59 +07:00
trungkienbkhn
6eec07739e
Add benchmarking logic for memory, wer and speed ( #773 )
2024-05-04 15:12:43 +07:00
jax
847fec4492
Feature/add hotwords ( #731 )
...
* add hotword params
---------
Co-authored-by: jax <jax_builder@gamil.com >
2024-05-04 15:11:52 +07:00
Keating Reid
46080e584e
Loosening tokenizers version constraint ( #804 )
2024-05-04 15:10:24 +07:00
Sidharth Rajaram
3d1de60ef3
CUDA version and updated installation instructions ( #785 )
...
* CUDA version note and updated instructions in README
* ctranslate2 downgrade note, cuDNN v9 consideration
* clearer note on cuDNN v9 package
2024-05-04 15:09:59 +07:00
otakutyrant
91c8307aa6
make faster_whisper.assets as a valid python package to distribute ( #772 ) ( #774 )
2024-04-02 18:22:22 +02:00
Purfview
b024972a56
Foolproof: Disable VAD if clip_timestamps is in use ( #769 )
...
* Foolproof: Disable VAD if clip_timestamps is in use
Prevent silly things to happen.
2024-04-02 18:20:34 +02:00
Purfview
8ae82c8372
Bugfix: code breaks if audio is empty ( #768 )
...
* Bugfix: code breaks if audio is empty
Regression since https://github.com/SYSTRAN/faster-whisper/pull/732 PR
2024-04-02 18:18:12 +02:00
trungkienbkhn
e0c3a9ed34
Update project github link to SYSTRAN ( #746 )
2024-03-27 08:31:17 +01:00
Sanchit Gandhi
a67e0e47ae
Add support for distil-large-v3 ( #755 )
...
* add distil-large-v3
* Update README.md
* use fp16 weights from Systran
2024-03-26 14:58:39 +01:00
trungkienbkhn
1eb9a8004c
Improve language detection ( #732 )
2024-03-12 15:44:49 +01:00
trungkienbkhn
a342b028b7
Bump version to 1.0.1 ( #725 )
2024-03-01 11:32:12 +01:00
Purfview
5090cc9d0d
Fix window end heuristic for hallucination_silence_threshold ( #706 )
...
Removes the wishful heuristic causing more issues than it's fixing.
Same as https://github.com/openai/whisper/pull/2043
Example of the issue: https://github.com/openai/whisper/pull/1838#issuecomment-1960041500
2024-02-29 17:59:32 +01:00
Gabriel F
09cd57e7f3
Fix typo 'ditil' ( #721 )
2024-02-29 17:08:58 +01:00
trungkienbkhn
16141e65d9
Add pad_or_trim function to handle segment before encoding ( #705 )
2024-02-29 17:08:28 +01:00
trungkienbkhn
06d32bf0c1
Bump version to 1.0.0 ( #696 )
2024-02-22 09:49:01 +01:00
Purfview
30d6043e90
Prevent infinite loop for out-of-bound timestamps in clip_timestamps ( #697 )
...
Same as https://github.com/openai/whisper/pull/2005
2024-02-22 09:48:35 +01:00
BBC-Esq
22c75d0cc3
Update README.md ( #672 )
...
Add Faster-Whisper-Transcriber to community integrations.
2024-02-21 10:18:11 +01:00
trungkienbkhn
092067208b
Add clip_timestamps and hallucination_silence_threshold options ( #646 )
2024-02-20 17:34:54 +01:00
Jordi Mas
6ffcbdfbc2
Fix typos in README.md ( #668 )
2024-02-20 17:33:17 +01:00
Purfview
52695567c9
Bumps up PyAV version to support Python 3.12.x ( #679 )
2024-02-20 17:31:07 +01:00
IlianP
c6b28ed3a0
Update README.md ( #685 )
...
I'm surprised that WhisperX hasn't made it into this list yet, as it has more stars than faster-whisper itself 🚀
2024-02-20 17:28:00 +01:00
trungkienbkhn
4ab646035f
Upgrade ctranslate2 version to support CUDA 12 ( #694 )
2024-02-20 17:26:55 +01:00
Purfview
f144e4c83d
Expands the note for distil-whisper ( #659 )
2024-01-28 21:48:40 +01:00
Purfview
3aec421849
Add: More clarity of what "max_new_tokens" does ( #658 )
...
* Add: More clarity of what "max_new_tokens" does
2024-01-28 21:40:33 +01:00
Dominik Macháček
64b9f244bd
Whisper-Streaming mention ( #656 )
...
under community integrations
2024-01-25 18:27:27 +01:00
Purfview
00efce1696
Bugfix: Illogical "Avoid computing higher temperatures on no_speech" ( #652 )
2024-01-24 11:54:43 +01:00
metame
ad3c83045b
support distil-whisper ( #557 )
2024-01-24 10:17:12 +01:00
Jürgen Fleiß
72ff979a2e
Add GUI faster-whisper project README.md ( #554 )
...
Added aTrain GUI faster-whisper transcription and diarization tool as community project.
Co-authored-by: JuergenFleiss <118339672+Juergen-J-F@users.noreply.github.com >
2024-01-18 13:01:02 +01:00
makaveli
615de0d2d9
add WhisperLive to community integration ( #647 )
2024-01-18 12:54:14 +01:00
Purfview
44f7e58947
Update whisper-standalone-win description in README.md ( #508 )
...
* Update whisper-standalone-win description in README.md
2023-12-14 13:03:46 +01:00
Purfview
ebcfd6b964
Fix broken prompt_reset_on_temperature ( #604 )
...
* Fix broken prompt_reset_on_temperature
Fixing: https://github.com/SYSTRAN/faster-whisper/issues/603
Broken because `generate_with_fallback()` doesn't return final temperature.
Regression since PR356 -> https://github.com/SYSTRAN/faster-whisper/pull/356
2023-12-13 13:14:39 +01:00
trungkienbkhn
19329a3611
Word timing tweaks ( #616 )
2023-12-13 12:38:44 +01:00
Purfview
65094b779e
Update info on cuBLAS and cuDNN libs in README.md ( #513 )
2023-11-27 12:12:47 +01:00
Clayton Yochum
9641d5f56a
Force read-mode in av.open ( #566 )
...
The `av.open` functions checks input metadata to determine the mode to open with ("r" or "w"). If an input to `decode_audio` is found to be in write-mode, without this change it can't be read. Forcing read mode solves this.
2023-11-27 10:43:35 +01:00
Dang Chuan Nguyen
e1a218fab1
Bump version to 0.10.0
2023-11-24 23:19:47 +01:00
Oscaarjs
3084409633
Add V3 Support ( #578 )
...
* Add V3 Support
* update conversion example
---------
Co-authored-by: oscaarjs <oscar.johansson@conversy.se >
2023-11-24 23:16:12 +01:00
Guillaume Klein
5a0541ea7d
Bump version to 0.9.0
2023-09-18 16:21:37 +02:00
Guillaume Klein
e94711bb5c
Add property WhisperModel.supported_languages ( #476 )
...
* Expose function supported_languages
* Make it a method
2023-09-14 17:42:02 +02:00
Guillaume Klein
0048844f54
Expose function available_models ( #475 )
...
* Expose function available_models
* Add test case
2023-09-14 17:17:01 +02:00
Guillaume Klein
a49097e655
Add some missing typing annotations in transcribe.py
2023-09-12 15:45:54 +02:00
Guillaume Klein
81086f6d33
Always run the encoder at the beginning of the loop ( #468 )
2023-09-12 14:44:37 +02:00
Guillaume Klein
f697945691
Update tokenizers requirement to include version 0.14 ( #469 )
2023-09-12 14:44:22 +02:00
Guillaume Klein
727ab81f31
Improve error message for invalid task and language parameters ( #466 )
2023-09-12 10:02:23 +02:00
Guillaume Klein
0285d46f6f
Add more details about the requirements in the README ( #463 )
2023-09-08 14:35:17 +02:00
Guillaume Klein
ad388cd394
Bump version to 0.8.0
2023-09-04 11:56:48 +02:00
Guillaume Klein
4a41746e55
Log a warning when the model is English-only but the language is set to something else ( #454 )
2023-09-04 11:55:40 +02:00
Guillaume Klein
1e6eb967c9
Add "large" alias for "large-v2" model ( #453 )
2023-09-04 11:54:42 +02:00
Guillaume Klein
f0ff12965a
Expose generation parameter no_repeat_ngram_size ( #449 )
2023-09-01 17:31:30 +02:00
Guillaume Klein
5871858a5f
Force the garbage collector to run after decoding the audio with PyAV ( #448 )
2023-09-01 15:25:13 +02:00