faster-whisper

Author	SHA1	Message	Date
heimoshuiyu	28a4d11a73	Revert "Merge remote-tracking branch 'upstream/master' into prompt" This reverts commit `6e42088656`, reversing changes made to `4a59bb011d`.	2024-09-12 00:49:31 +08:00
Mahmoud Ashraf	d57c5b40b0	Remove the usage of `transformers.pipeline` from `BatchedInferencePipeline` and fix word timestamps for batched inference (#921 ) * fix word timestamps for batched inference * remove hf pipeline	2024-07-27 09:02:58 +07:00
Jilt Sebastian	eb8390233c	New PR for Faster Whisper: Batching Support, Speed Boosts, and Quality Enhancements (#856 ) Batching Support, Speed Boosts, and Quality Enhancements --------- Co-authored-by: Hargun Mujral <83234565+hargunmujral@users.noreply.github.com> Co-authored-by: MahmoudAshraf97 <hassouna97.ma@gmail.com>	2024-07-18 16:48:52 +07:00
ddorian	e11d58599d	Allow av to include version 12. (#819 )	2024-05-06 08:57:35 +07:00
Keating Reid	46080e584e	Loosening tokenizers version constraint (#804 )	2024-05-04 15:10:24 +07:00
Purfview	52695567c9	Bumps up PyAV version to support Python 3.12.x (#679 )	2024-02-20 17:31:07 +01:00
trungkienbkhn	4ab646035f	Upgrade ctranslate2 version to support CUDA 12 (#694 )	2024-02-20 17:26:55 +01:00
Oscaarjs	3084409633	Add V3 Support (#578 ) * Add V3 Support * update conversion example --------- Co-authored-by: oscaarjs <oscar.johansson@conversy.se>	2023-11-24 23:16:12 +01:00
Guillaume Klein	f697945691	Update tokenizers requirement to include version 0.14 (#469 )	2023-09-12 14:44:22 +02:00
Guillaume Klein	0e051a5b77	Prepend prefix tokens with the initial timestamp token (#358 )	2023-07-18 15:22:39 +02:00
Guillaume Klein	a150adcc19	Enable onnxruntime dependency for Python 3.11 (#260 )	2023-05-24 16:07:54 +02:00
Guillaume Klein	19698c95f8	Support VAD filter (#95 ) * Support VAD filter * Generalize function collect_samples * Define AudioSegment class * Only pass prompt and prefix to the first chunk * Add dict argument vad_parameters * Fix isort format * Rename method * Update README * Add shortcut when the chunk offset is 0 * Reword readme * Fix end property * Concatenate the speech chunks * Cleanup diff * Increase default speech pad * Update README * Increase default speech pad	2023-04-03 17:22:48 +02:00
Guillaume Klein	de7682a2f0	Automatically download converted models from the Hugging Face Hub (#70 ) * Automatically download converted models from the Hugging Face Hub * Remove unused import * Remove non needed requirements in dev mode * Remove extra index URL when pip install in CI * Allow downloading to a specific directory * Update docstring * Add argument to disable the progess bars * Fix typo in docstring	2023-03-24 10:55:55 +01:00
Guillaume Klein	523ae2180f	Run the encoder only once for each 30-second window (#73 )	2023-03-24 10:53:49 +01:00
Guillaume Klein	8bd013ea99	Add word-level timestamps (#43 ) * Add word-level timestamps * Fix alignment between the segments and the lists of words * Fix truncated words list when the replacement character is decoded * Check for empty text_tokens * Add usage example in the readme * Update ctranslate2 to 3.9 * Skip empty segment * Set typing for the new methods	2023-03-15 15:02:28 +01:00
Guillaume Klein	469244a57d	Update CTranslate2 to 3.8.0	2023-03-06 16:21:48 +01:00
Guillaume Klein	ef71be09ed	Update CTranslate2 to 3.7.0	2023-02-23 11:18:58 +01:00
Guillaume Klein	cda834c8ea	Update CTranslate2 to 3.6.0	2023-02-16 17:01:19 +01:00
Guillaume Klein	5e938cba4e	Bump minimum CTranslate2 requirement to 3.5.1	2023-02-13 21:16:54 +01:00
Guillaume Klein	5216d52d94	Initial commit	2023-02-11 10:21:19 +01:00

20 Commits