Guillaume Klein
a49097e655
Add some missing typing annotations in transcribe.py
2023-09-12 15:45:54 +02:00
Guillaume Klein
81086f6d33
Always run the encoder at the beginning of the loop ( #468 )
2023-09-12 14:44:37 +02:00
Guillaume Klein
f697945691
Update tokenizers requirement to include version 0.14 ( #469 )
2023-09-12 14:44:22 +02:00
Guillaume Klein
727ab81f31
Improve error message for invalid task and language parameters ( #466 )
2023-09-12 10:02:23 +02:00
Guillaume Klein
0285d46f6f
Add more details about the requirements in the README ( #463 )
2023-09-08 14:35:17 +02:00
Guillaume Klein
ad388cd394
Bump version to 0.8.0
2023-09-04 11:56:48 +02:00
Guillaume Klein
4a41746e55
Log a warning when the model is English-only but the language is set to something else ( #454 )
2023-09-04 11:55:40 +02:00
Guillaume Klein
1e6eb967c9
Add "large" alias for "large-v2" model ( #453 )
2023-09-04 11:54:42 +02:00
Guillaume Klein
f0ff12965a
Expose generation parameter no_repeat_ngram_size ( #449 )
2023-09-01 17:31:30 +02:00
Guillaume Klein
5871858a5f
Force the garbage collector to run after decoding the audio with PyAV ( #448 )
2023-09-01 15:25:13 +02:00
MinorJinx
e87fbf8a49
Added audio duration after VAD to TranscriptionInfo object ( #445 )
...
* Added VAD removed audio duration to TranscriptionInfo object
Along with the duration of the original audio, this commit adds the seconds of audio removed by the VAD to the returned info obj
* Chaning naming for duration_after_vad
Instead of the property returning the audio duration removed, it now returns the final duration after the vad.
If vad_filter is False or if it doesn't remove any audio, the original duration is returned.
2023-08-31 17:19:48 +02:00
Hrishikesh Barman
7b271da035
docs: add wscribe to community integrations ( #427 )
...
wscribe is a utility to generate transcript specifically to make it easy
for further manual edits accompanied by the wscribe-editor
2023-08-17 08:50:24 +02:00
Aisu Wata
1562b02345
added repetition_penalty to TranscriptionOptions ( #403 )
...
Co-authored-by: Aisu Wata <aisu.wata0@gmail.com >
2023-08-06 10:08:24 +02:00
Purfview
1ce16652ee
Adds DEBUG log message for prompt_reset_on_temperature ( #399 )
...
Produce DEBUG log message if prompt_reset_on_temperature threshold is met.
2023-08-04 09:06:17 +02:00
Purfview
857be6f621
Rename clear_previous_text_on_temperature argument ( #398 )
...
`prompt_reset_on_temperature` is more clear what it does.
2023-08-03 18:44:37 +02:00
KH
1a1eb1a027
Add clear_previous_text_on_temperature parameter ( #397 )
...
* Add clear_previous_text_on_temperature parameter
* Add a description
2023-08-03 15:40:58 +02:00
Guillaume Klein
5c17de1771
Bump version to 0.7.1
2023-07-24 11:10:12 +02:00
Guillaume Klein
0f55c436fe
Invalidate the cached encoder output when no_speech threshold is met ( #376 )
2023-07-24 10:57:15 +02:00
KH
e786e26f75
Return result with best log prob when all temperature fallbacks failed ( #356 )
...
* Resolve Inference Selection Bug
* Refactor for better readability
* Filter out results with compression_ratio
* Refactor to avoid variable repetition
* Fix incorrect index and perform minor refactoring
* Remove final_temperature variable
2023-07-20 16:13:11 +02:00
KH
687db319e0
Remove duplicate code ( #359 )
2023-07-18 16:03:01 +02:00
Guillaume Klein
171d90dd1f
Bump version to 0.7.0
2023-07-18 15:23:47 +02:00
Guillaume Klein
0e051a5b77
Prepend prefix tokens with the initial timestamp token ( #358 )
2023-07-18 15:22:39 +02:00
Guillaume Klein
2a37390fed
Minor reformatting in code snippet
2023-07-18 15:08:53 +02:00
Hoon
3b4a6aa1c2
Improve timestamp heuristics ( #336 )
...
* Improve timestamp heuristics
* Chore
2023-07-05 15:16:53 +02:00
zh-plus
c7cb2aa8d4
Add support for using whisper models from Huggingface by specifying the model id. ( #334 )
...
* Add support for downloading CTranslate-converted models from Huggingface.
* Update utils.py to pass Flake8.
* Update utils.py to pass black.
* Remove redundant usage instructions.
* Apply suggestions from code review
Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com >
---------
Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com >
2023-07-03 17:40:10 +02:00
Guillaume Klein
c0d93d0829
Avoid computing higher temperatures on no_speech segments ( #225 )
...
Port commit e334ff141d
2023-07-03 10:20:36 +02:00
Guillaume Klein
19c294f978
Squash long words at window and sentence boundaries ( #226 )
...
Port commit 255887f219
2023-07-03 10:20:20 +02:00
FlippFuzz
fee52c9229
Allow users to input an Iterable of token ids into initial_prompt ( #306 )
...
* Allow users to input an Iterable of token ids into initial_prompt
* Need to check for String first because string is also an Iterable
2023-06-21 14:46:20 +02:00
Guillaume Klein
efc4f61d85
Do not specify the vocabulary file extension in the download pattern ( #311 )
2023-06-20 10:53:11 +02:00
kh
ad58ba26ab
Fix typo ( #304 )
...
https://github.com/snakers4/silero-vad/discussions/319#discussion-5081706
2023-06-16 07:37:45 +02:00
zh-plus
20d4e9418b
Add Open-Lyrics as a community project. ( #291 )
2023-06-10 08:22:29 +02:00
Antonio Zarauz Moreno
d4222da952
Update README with community repo using FW ( #284 )
...
* Update README with community repo using FW
* Minor formatting change
---------
Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com >
2023-06-07 11:30:53 +02:00
Guillaume Klein
1bb7e33b93
Reformat code snippet in README
2023-05-24 18:22:44 +02:00
Guillaume Klein
2a00621564
Bump version to 0.6.0
2023-05-24 16:15:01 +02:00
Guillaume Klein
a150adcc19
Enable onnxruntime dependency for Python 3.11 ( #260 )
2023-05-24 16:07:54 +02:00
Guillaume Klein
ae1e6d9883
Remove reference to the VAD function from the README
2023-05-24 15:56:21 +02:00
Guillaume Klein
cf7c021573
Export __version__ at the module level ( #258 )
2023-05-24 15:50:37 +02:00
Guillaume Klein
4db549b800
Make get_speech_timestamps backward compatible with the previous usage ( #259 )
2023-05-24 15:49:36 +02:00
Guillaume Klein
c99feb22dc
Include requirements files in sdist ( #240 )
2023-05-24 12:55:15 +02:00
Guillaume Klein
723cb97483
Fix occasional IndexError on empty segments ( #227 )
2023-05-24 12:55:04 +02:00
Guillaume Klein
6a2da9a95c
Also catch client-side network exceptions when synchronizing models ( #228 )
2023-05-11 15:07:15 +02:00
Guillaume Klein
6a1d331d66
Add CONTRIBUTING.md ( #229 )
2023-05-11 15:06:46 +02:00
Guillaume Klein
2d7c984bfc
Reformat function download_model for clarity
2023-05-11 14:47:22 +02:00
Guillaume Klein
8e5c747ab5
Reformat list of community integrations
2023-05-11 12:15:41 +02:00
Purfview
32b962bed8
Adds: whisper-standalone-win ( #216 )
2023-05-09 20:20:41 +02:00
David Axelrod
53d247b0bb
retry model download locally if huggingface throws an http error. ( #215 )
...
* rety model download locally if huggingface throws an http error.
* appease the linter
* key error fix
* use non internal lib error
Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com >
---------
Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com >
2023-05-09 17:20:22 +02:00
Ozan Caglayan
91f948b0d6
transcribe: return all language probabilities if requested ( #210 )
...
* transcribe: return all language probabilities if requested
If return_all_language_probs is True, TranscriptionInfo structure
will have a list of tuples reflecting all language probabilities
as returned by the model.
* transcribe: fix docstring
* transcribe: remove return_all_lang_probs parameter
2023-05-09 14:53:47 +02:00
FlippFuzz
5d8f3e2d90
Implement VadOptions ( #198 )
...
* Implement VadOptions
* Fix line too long
./faster_whisper/transcribe.py:226:101: E501 line too long (111 > 100 characters)
* Reformatted files with black
* black .\faster_whisper\vad.py
* black .\faster_whisper\transcribe.py
* Fix import order with isort
* isort .\faster_whisper\vad.py
* isort .\faster_whisper\transcribe.py
* Made recommended changes
Recommended in https://github.com/guillaumekln/faster-whisper/pull/198
* Fix typing of vad_options argument
---------
Co-authored-by: Guillaume Klein <guillaumekln@users.noreply.github.com >
2023-05-09 12:47:02 +02:00
Mahmoud Ashraf
d889345e07
added whisper-diarize ( #193 )
2023-04-28 10:56:13 +02:00
Jordi Mas
5d203d2757
Update Github link to community project ( #187 )
2023-04-27 14:53:28 +02:00