Guillaume Klein
|
f0a21ea916
|
Use a dict to represent intermediate segments
|
2023-03-09 11:53:55 +01:00 |
|
Guillaume Klein
|
6a84df400f
|
Fix all_tokens handling
See 38f2f4d99d
|
2023-03-09 10:02:25 +01:00 |
|
Guillaume Klein
|
4176da0d68
|
Rename offset to seek to match the OpenAI implementation
|
2023-03-09 09:58:58 +01:00 |
|
Guillaume Klein
|
6b16b8a69c
|
Pad the audio instead of the spectrogram
See 919a713499
|
2023-03-08 10:50:46 +01:00 |
|
Guillaume Klein
|
01ef12a6a0
|
Do not ignore last segment ending with one timestamp
See eab8d920ed
|
2023-03-07 10:05:04 +01:00 |
|
Guillaume Klein
|
469244a57d
|
Update CTranslate2 to 3.8.0
|
2023-03-06 16:21:48 +01:00 |
|
Guillaume Klein
|
4a18adc382
|
Load the tokenizer from the model directory if it exists
|
2023-03-01 15:47:16 +01:00 |
|
Guillaume Klein
|
873992623c
|
Accept the audio waveform as an input to transcribe() (#21)
|
2023-02-28 19:01:31 +01:00 |
|
Guillaume Klein
|
a4f1cc8f11
|
Add prefix parameter
|
2023-02-27 12:09:40 +01:00 |
|
Guillaume Klein
|
528aa3e784
|
Make threshold parameters optional
|
2023-02-27 11:32:03 +01:00 |
|
Guillaume Klein
|
f0add58bdc
|
Add typing to constructor and transcribe method
|
2023-02-27 11:22:02 +01:00 |
|
Guillaume Klein
|
ef71be09ed
|
Update CTranslate2 to 3.7.0
|
2023-02-23 11:18:58 +01:00 |
|
Guillaume Klein
|
d91365e321
|
Minor code simplification
|
2023-02-22 11:02:11 +01:00 |
|
Guillaume Klein
|
4b8237da1b
|
Strip the leading space before computing the compression ratio
|
2023-02-22 10:28:04 +01:00 |
|
Guillaume Klein
|
e47e00910a
|
Add length_penalty parameter and correctly compute the avg log prob
|
2023-02-22 10:27:38 +01:00 |
|
Guillaume Klein
|
f5c9f15c2c
|
Check that the language code is valid
|
2023-02-21 12:10:54 +01:00 |
|
Guillaume Klein
|
e2094b6474
|
Reduce the maximum length when the prompt is longer than 448/2
|
2023-02-17 14:37:24 +01:00 |
|
Guillaume Klein
|
123d9a5704
|
Support English-only models
|
2023-02-16 17:02:40 +01:00 |
|
Guillaume Klein
|
cbbe633082
|
Add num_workers parameter
|
2023-02-14 09:34:05 +01:00 |
|
Guillaume Klein
|
c86353d323
|
Add task parameter
|
2023-02-13 21:26:25 +01:00 |
|
Guillaume Klein
|
f56dfc6491
|
Add without_timestamps parameter
|
2023-02-13 21:22:05 +01:00 |
|
Guillaume Klein
|
3dc44f7bb5
|
Raise a more explicit error message for English-only models
|
2023-02-13 18:26:45 +01:00 |
|
Guillaume Klein
|
269b3dfb10
|
Expose the device_index argument (#5)
|
2023-02-13 11:06:40 +01:00 |
|
Guillaume Klein
|
60e667e0d2
|
Cleanup unused import
|
2023-02-12 11:44:05 +01:00 |
|
Guillaume Klein
|
7d1d0541c8
|
Add the initial_prompt parameter (#2)
* Add the initial_prompt parameter
* Add docstring
|
2023-02-12 11:42:21 +01:00 |
|
Guillaume Klein
|
23d2d64259
|
Update transcribe.py
|
2023-02-11 11:47:07 +01:00 |
|
Guillaume Klein
|
5216d52d94
|
Initial commit
|
2023-02-11 10:21:19 +01:00 |
|