faster-whisper

Author	SHA1	Message	Date
Guillaume Klein	0224400584	Add large-v1 model	2023-03-28 14:36:10 +02:00
Guillaume Klein	8246479fda	Ignore the invalid audio frames (#82 )	2023-03-27 10:19:22 +02:00
Guillaume Klein	e2705d11c9	Raise an explicit error message if the model size is invalid	2023-03-26 16:30:00 +02:00
Jordi Mas	f8d2fb169f	Fix variable name reference (#77 )	2023-03-25 10:00:59 +01:00
Guillaume Klein	a10732c74a	Only download the required model files	2023-03-24 17:59:11 +01:00
Guillaume Klein	7808eddf06	Bump version to 0.3.0	2023-03-24 10:56:42 +01:00
Guillaume Klein	de7682a2f0	Automatically download converted models from the Hugging Face Hub (#70 ) * Automatically download converted models from the Hugging Face Hub * Remove unused import * Remove non needed requirements in dev mode * Remove extra index URL when pip install in CI * Allow downloading to a specific directory * Update docstring * Add argument to disable the progess bars * Fix typo in docstring	2023-03-24 10:55:55 +01:00
Guillaume Klein	523ae2180f	Run the encoder only once for each 30-second window (#73 )	2023-03-24 10:53:49 +01:00
Guillaume Klein	2b7be47041	Update README.md	2023-03-24 09:15:05 +01:00
Guillaume Klein	3f02c53610	Add .gitignore file	2023-03-23 20:52:46 +01:00
Guillaume Klein	e663186a4b	Add some badges at the top of the README	2023-03-23 20:33:19 +01:00
Guillaume Klein	e44a8c7ba0	Update the README following the PyPI release	2023-03-22 21:07:27 +01:00
Guillaume Klein	33f41d84e3	Add job to push a package for each new Git tag	2023-03-22 21:01:53 +01:00
Guillaume Klein	c910ec0293	Bump version to 0.2.0	2023-03-22 20:54:07 +01:00
Guillaume Klein	e9dfe23eaa	Complete the package metadata	2023-03-22 20:53:51 +01:00
Guillaume Klein	66efd02bd0	Run some automatic tests with GitHub Actions (#68 )	2023-03-22 20:50:03 +01:00
Guillaume Klein	52264f2277	Fix typing for device_index argument	2023-03-22 13:51:12 +01:00
Guillaume Klein	c27c010f96	Ignore Unicode errors in input file metadata	2023-03-21 17:13:37 +01:00
Guillaume Klein	0ab8db2b37	Remove debug prints	2023-03-18 09:48:02 +01:00
Guillaume Klein	a70aac18ae	Remove unused import	2023-03-18 09:47:02 +01:00
Guillaume Klein	d82be59d5f	Fix unset attribute when using English-only models	2023-03-17 18:33:16 +01:00
Guillaume Klein	58f4447964	Update benchmark results with latest openai/whisper and faster-whisper	2023-03-17 16:44:07 +01:00
Guillaume Klein	cce6b53e45	Fix incorrect attribute access	2023-03-16 10:32:36 +01:00
Guillaume Klein	2007adf0b5	Fix typing of words attribute	2023-03-15 17:49:07 +01:00
Guillaume Klein	ae9898f0d8	Include duration in AudioInfo structure	2023-03-15 15:30:29 +01:00
Guillaume Klein	c5f6b91b7d	Port utility function format_timestamp	2023-03-15 15:27:20 +01:00
Guillaume Klein	eafb2c79a3	Add more typing annotations	2023-03-15 15:22:53 +01:00
Guillaume Klein	8bd013ea99	Add word-level timestamps (#43 ) * Add word-level timestamps * Fix alignment between the segments and the lists of words * Fix truncated words list when the replacement character is decoded * Check for empty text_tokens * Add usage example in the readme * Update ctranslate2 to 3.9 * Skip empty segment * Set typing for the new methods	2023-03-15 15:02:28 +01:00
Guillaume Klein	b41fd05948	Update python_requires to >=3.8	2023-03-10 11:15:58 +01:00
Guillaume Klein	3301dd9273	Make get_input a free function	2023-03-09 12:54:41 +01:00
Guillaume Klein	c52adaca90	Create a helper class Tokenizer	2023-03-09 12:53:49 +01:00
Guillaume Klein	f0a21ea916	Use a dict to represent intermediate segments	2023-03-09 11:53:55 +01:00
Guillaume Klein	6a84df400f	Fix all_tokens handling See `38f2f4d99d`	2023-03-09 10:02:25 +01:00
Guillaume Klein	4176da0d68	Rename offset to seek to match the OpenAI implementation	2023-03-09 09:58:58 +01:00
Guillaume Klein	6b16b8a69c	Pad the audio instead of the spectrogram See `919a713499`	2023-03-08 10:50:46 +01:00
Guillaume Klein	2646906596	Fix error in decode_audio for long audio inputs	2023-03-07 10:15:36 +01:00
Guillaume Klein	01ef12a6a0	Do not ignore last segment ending with one timestamp See `eab8d920ed`	2023-03-07 10:05:04 +01:00
Guillaume Klein	469244a57d	Update CTranslate2 to 3.8.0	2023-03-06 16:21:48 +01:00
Guillaume Klein	4a18adc382	Load the tokenizer from the model directory if it exists	2023-03-01 15:47:16 +01:00
Guillaume Klein	873992623c	Accept the audio waveform as an input to transcribe() (#21 )	2023-02-28 19:01:31 +01:00
Guillaume Klein	ed32002aea	Add instructions to install without git clone	2023-02-27 12:21:54 +01:00
Guillaume Klein	a4f1cc8f11	Add prefix parameter	2023-02-27 12:09:40 +01:00
Guillaume Klein	528aa3e784	Make threshold parameters optional	2023-02-27 11:32:03 +01:00
Guillaume Klein	f0add58bdc	Add typing to constructor and transcribe method	2023-02-27 11:22:02 +01:00
Guillaume Klein	b1c69927f8	Update code snippet to be consistent with the conversion example	2023-02-24 15:52:23 +01:00
Guillaume Klein	ef71be09ed	Update CTranslate2 to 3.7.0	2023-02-23 11:18:58 +01:00
Guillaume Klein	f5c0e44935	Update README.md	2023-02-22 14:59:29 +01:00
Guillaume Klein	d91365e321	Minor code simplification	2023-02-22 11:02:11 +01:00
Guillaume Klein	4b8237da1b	Strip the leading space before computing the compression ratio	2023-02-22 10:28:04 +01:00
Guillaume Klein	e47e00910a	Add length_penalty parameter and correctly compute the avg log prob	2023-02-22 10:27:38 +01:00

1 2 3 4 5

224 Commits