Commit Graph

192 Commits

Author SHA1 Message Date
KH
1a1eb1a027 Add clear_previous_text_on_temperature parameter (#397)
* Add clear_previous_text_on_temperature parameter

* Add a description
2023-08-03 15:40:58 +02:00
Guillaume Klein
5c17de1771 Bump version to 0.7.1 2023-07-24 11:10:12 +02:00
Guillaume Klein
0f55c436fe Invalidate the cached encoder output when no_speech threshold is met (#376) 2023-07-24 10:57:15 +02:00
KH
e786e26f75 Return result with best log prob when all temperature fallbacks failed (#356)
* Resolve Inference Selection Bug

* Refactor for better readability

* Filter out results with compression_ratio

* Refactor to avoid variable repetition

* Fix incorrect index and perform minor refactoring

* Remove final_temperature variable
2023-07-20 16:13:11 +02:00
KH
687db319e0 Remove duplicate code (#359) 2023-07-18 16:03:01 +02:00
Guillaume Klein
171d90dd1f Bump version to 0.7.0 2023-07-18 15:23:47 +02:00
Guillaume Klein
0e051a5b77 Prepend prefix tokens with the initial timestamp token (#358) 2023-07-18 15:22:39 +02:00
Guillaume Klein
2a37390fed Minor reformatting in code snippet 2023-07-18 15:08:53 +02:00
Hoon
3b4a6aa1c2 Improve timestamp heuristics (#336)
* Improve timestamp heuristics

* Chore
2023-07-05 15:16:53 +02:00
zh-plus
c7cb2aa8d4 Add support for using whisper models from Huggingface by specifying the model id. (#334)
* Add support for downloading CTranslate-converted models from Huggingface.

* Update utils.py to pass Flake8.

* Update utils.py to pass black.

* Remove redundant usage instructions.

* Apply suggestions from code review

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>

---------

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>
2023-07-03 17:40:10 +02:00
Guillaume Klein
c0d93d0829 Avoid computing higher temperatures on no_speech segments (#225)
Port commit e334ff141d
2023-07-03 10:20:36 +02:00
Guillaume Klein
19c294f978 Squash long words at window and sentence boundaries (#226)
Port commit 255887f219
2023-07-03 10:20:20 +02:00
FlippFuzz
fee52c9229 Allow users to input an Iterable of token ids into initial_prompt (#306)
* Allow users to input an Iterable of token ids into initial_prompt

* Need to check for String first because string is also an Iterable
2023-06-21 14:46:20 +02:00
Guillaume Klein
efc4f61d85 Do not specify the vocabulary file extension in the download pattern (#311) 2023-06-20 10:53:11 +02:00
kh
ad58ba26ab Fix typo (#304)
https://github.com/snakers4/silero-vad/discussions/319#discussion-5081706
2023-06-16 07:37:45 +02:00
zh-plus
20d4e9418b Add Open-Lyrics as a community project. (#291) 2023-06-10 08:22:29 +02:00
Antonio Zarauz Moreno
d4222da952 Update README with community repo using FW (#284)
* Update README with community repo using FW

* Minor formatting change

---------

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>
2023-06-07 11:30:53 +02:00
Guillaume Klein
1bb7e33b93 Reformat code snippet in README 2023-05-24 18:22:44 +02:00
Guillaume Klein
2a00621564 Bump version to 0.6.0 2023-05-24 16:15:01 +02:00
Guillaume Klein
a150adcc19 Enable onnxruntime dependency for Python 3.11 (#260) 2023-05-24 16:07:54 +02:00
Guillaume Klein
ae1e6d9883 Remove reference to the VAD function from the README 2023-05-24 15:56:21 +02:00
Guillaume Klein
cf7c021573 Export __version__ at the module level (#258) 2023-05-24 15:50:37 +02:00
Guillaume Klein
4db549b800 Make get_speech_timestamps backward compatible with the previous usage (#259) 2023-05-24 15:49:36 +02:00
Guillaume Klein
c99feb22dc Include requirements files in sdist (#240) 2023-05-24 12:55:15 +02:00
Guillaume Klein
723cb97483 Fix occasional IndexError on empty segments (#227) 2023-05-24 12:55:04 +02:00
Guillaume Klein
6a2da9a95c Also catch client-side network exceptions when synchronizing models (#228) 2023-05-11 15:07:15 +02:00
Guillaume Klein
6a1d331d66 Add CONTRIBUTING.md (#229) 2023-05-11 15:06:46 +02:00
Guillaume Klein
2d7c984bfc Reformat function download_model for clarity 2023-05-11 14:47:22 +02:00
Guillaume Klein
8e5c747ab5 Reformat list of community integrations 2023-05-11 12:15:41 +02:00
Purfview
32b962bed8 Adds: whisper-standalone-win (#216) 2023-05-09 20:20:41 +02:00
David Axelrod
53d247b0bb retry model download locally if huggingface throws an http error. (#215)
* rety model download locally if huggingface throws an http error.

* appease the linter

* key error fix

* use non internal lib error

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>

---------

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>
2023-05-09 17:20:22 +02:00
Ozan Caglayan
91f948b0d6 transcribe: return all language probabilities if requested (#210)
* transcribe: return all language probabilities if requested

If return_all_language_probs is True, TranscriptionInfo structure
will have a list of tuples reflecting all language probabilities
as returned by the model.

* transcribe: fix docstring

* transcribe: remove return_all_lang_probs parameter
2023-05-09 14:53:47 +02:00
FlippFuzz
5d8f3e2d90 Implement VadOptions (#198)
* Implement VadOptions

* Fix line too long

./faster_whisper/transcribe.py:226:101: E501 line too long (111 > 100 characters)

* Reformatted files with black

* black .\faster_whisper\vad.py    
* black .\faster_whisper\transcribe.py

* Fix import order with isort

* isort .\faster_whisper\vad.py
* isort .\faster_whisper\transcribe.py

* Made recommended changes

Recommended in https://github.com/guillaumekln/faster-whisper/pull/198

* Fix typing of vad_options argument

---------

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>
2023-05-09 12:47:02 +02:00
Mahmoud Ashraf
d889345e07 added whisper-diarize (#193) 2023-04-28 10:56:13 +02:00
Jordi Mas
5d203d2757 Update Github link to community project (#187) 2023-04-27 14:53:28 +02:00
Guillaume Klein
a3dcb90081 Bump version to 0.5.1 2023-04-26 17:38:16 +02:00
Guillaume Klein
89a4c7f1f0 Update docstring to clarify download_root and output_dir 2023-04-26 17:37:51 +02:00
Guillaume Klein
6f9d68dd6b Fix typing of local_files_only 2023-04-26 17:36:24 +02:00
Jordi Mas
68df3214ba Use cache_dir instead of local_dir (#182)
* Use cache_dir instead of local_dir

* Fix unit test

* Use cache_dir and preserve local_dir parameter

* Remove blank line at the end

* Disable ut

* Implement  download_root suggestion

* Use cache_dir=download_root
2023-04-26 16:35:18 +02:00
Guillaume Klein
67cce3f552 Bump version to 0.5.0 2023-04-25 17:00:41 +02:00
Guillaume Klein
8340e04dc6 Assign words to the speech chunk with the greatest coverage (#180) 2023-04-25 15:54:31 +02:00
Guillaume Klein
8cf5d5a4b3 Increase the default value of speech_pad_ms to 400 ms (#179) 2023-04-25 15:54:22 +02:00
Guillaume Klein
32dc625f11 Update README.md 2023-04-25 15:47:38 +02:00
Guillaume Klein
e06511f96b Rename AudioInfo to TranscriptionInfo (#174) 2023-04-24 16:29:17 +02:00
Anthony
338a725ff8 fix where the tokens are reset (#175) 2023-04-24 16:28:47 +02:00
Amar Sood
f893113759 Align segment structure with openai/whisper (#154)
* Align segment structure with openai/whisper

* Update code to apply requested changes

* Move increment below the segment filtering

---------

Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com>
2023-04-24 15:04:42 +02:00
FlippFuzz
2b51a97e61 Add transcription_options to AudioInfo (#170)
* Add transcription_options to AudioInfo

It would be great if we can include the transcription_options in AudioInfo.

My application is only making a few changes but leaving the rest as default.
However, I would like to record down all settings (including those that I did not specify) so that the audio can be transcribed again identically in future if need be.

* Make TranscriptionOptions appear before AudioInfo

* Remove unnecessary whitespace
2023-04-24 15:02:19 +02:00
Jordi Mas
358d373691 Allow specifying local_files_only to prevent checking the Internet everytime (#166) 2023-04-20 14:26:06 +02:00
Guillaume Klein
3adcc12d0f Clarify that the returned segments value is a generator (#144)
* Clarify that the returned segments value is a generator

* Update README.md
2023-04-13 09:50:53 +02:00
Ewald Enzinger
2b53dee6b6 Expose download location in WhisperModel constructor (#126)
This increases compatibility with OpenAI Whisper's whisper.load_model() and is useful for downstream integrations
2023-04-08 10:02:36 +02:00