README.md · faster-whisper-tiny

README.md

1.9 KB · 141 lines · markdown Raw

1	`---`
2	`language:`
3	`- en`
4	`- zh`
5	`- de`
6	`- es`
7	`- ru`
8	`- ko`
9	`- fr`
10	`- ja`
11	`- pt`
12	`- tr`
13	`- pl`
14	`- ca`
15	`- nl`
16	`- ar`
17	`- sv`
18	`- it`
19	`- id`
20	`- hi`
21	`- fi`
22	`- vi`
23	`- he`
24	`- uk`
25	`- el`
26	`- ms`
27	`- cs`
28	`- ro`
29	`- da`
30	`- hu`
31	`- ta`
32	`- 'no'`
33	`- th`
34	`- ur`
35	`- hr`
36	`- bg`
37	`- lt`
38	`- la`
39	`- mi`
40	`- ml`
41	`- cy`
42	`- sk`
43	`- te`
44	`- fa`
45	`- lv`
46	`- bn`
47	`- sr`
48	`- az`
49	`- sl`
50	`- kn`
51	`- et`
52	`- mk`
53	`- br`
54	`- eu`
55	`- is`
56	`- hy`
57	`- ne`
58	`- mn`
59	`- bs`
60	`- kk`
61	`- sq`
62	`- sw`
63	`- gl`
64	`- mr`
65	`- pa`
66	`- si`
67	`- km`
68	`- sn`
69	`- yo`
70	`- so`
71	`- af`
72	`- oc`
73	`- ka`
74	`- be`
75	`- tg`
76	`- sd`
77	`- gu`
78	`- am`
79	`- yi`
80	`- lo`
81	`- uz`
82	`- fo`
83	`- ht`
84	`- ps`
85	`- tk`
86	`- nn`
87	`- mt`
88	`- sa`
89	`- lb`
90	`- my`
91	`- bo`
92	`- tl`
93	`- mg`
94	`- as`
95	`- tt`
96	`- haw`
97	`- ln`
98	`- ha`
99	`- ba`
100	`- jw`
101	`- su`
102	`tags:`
103	`- audio`
104	`- automatic-speech-recognition`
105	`license: mit`
106	`library_name: ctranslate2`
107	`---`
108
109	`# Whisper tiny model for CTranslate2`
110
111	`This repository contains the conversion of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) to the [CTranslate2](https://github.com/OpenNMT/CTranslate2) model format.`
112
113	`This model can be used in CTranslate2 or projects based on CTranslate2 such as [faster-whisper](https://github.com/systran/faster-whisper).`
114
115	`## Example`
116
117	```python
118	`from faster_whisper import WhisperModel`
119
120	`model = WhisperModel("tiny")`
121
122	`segments, info = model.transcribe("audio.mp3")`
123	`for segment in segments:`
124	`print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))`
125	```
126
127	`## Conversion details`
128
129	`The original model was converted with the following command:`
130
131	```
132	`ct2-transformers-converter --model openai/whisper-tiny --output_dir faster-whisper-tiny \`
133	`--copy_files tokenizer.json --quantization float16`
134	```
135
136	Note that the model weights are saved in FP16. This type can be changed when the model is loaded using the [`compute_type` option in CTranslate2](https://opennmt.net/CTranslate2/quantization.html).
137
138	`## More information`
139
140	`For more information about the original model, see its [model card](https://huggingface.co/openai/whisper-tiny).`
141