README.md
| 1 | --- |
| 2 | language: |
| 3 | - en |
| 4 | - zh |
| 5 | - de |
| 6 | - es |
| 7 | - ru |
| 8 | - ko |
| 9 | - fr |
| 10 | - ja |
| 11 | - pt |
| 12 | - tr |
| 13 | - pl |
| 14 | - ca |
| 15 | - nl |
| 16 | - ar |
| 17 | - sv |
| 18 | - it |
| 19 | - id |
| 20 | - hi |
| 21 | - fi |
| 22 | - vi |
| 23 | - he |
| 24 | - uk |
| 25 | - el |
| 26 | - ms |
| 27 | - cs |
| 28 | - ro |
| 29 | - da |
| 30 | - hu |
| 31 | - ta |
| 32 | - 'no' |
| 33 | - th |
| 34 | - ur |
| 35 | - hr |
| 36 | - bg |
| 37 | - lt |
| 38 | - la |
| 39 | - mi |
| 40 | - ml |
| 41 | - cy |
| 42 | - sk |
| 43 | - te |
| 44 | - fa |
| 45 | - lv |
| 46 | - bn |
| 47 | - sr |
| 48 | - az |
| 49 | - sl |
| 50 | - kn |
| 51 | - et |
| 52 | - mk |
| 53 | - br |
| 54 | - eu |
| 55 | - is |
| 56 | - hy |
| 57 | - ne |
| 58 | - mn |
| 59 | - bs |
| 60 | - kk |
| 61 | - sq |
| 62 | - sw |
| 63 | - gl |
| 64 | - mr |
| 65 | - pa |
| 66 | - si |
| 67 | - km |
| 68 | - sn |
| 69 | - yo |
| 70 | - so |
| 71 | - af |
| 72 | - oc |
| 73 | - ka |
| 74 | - be |
| 75 | - tg |
| 76 | - sd |
| 77 | - gu |
| 78 | - am |
| 79 | - yi |
| 80 | - lo |
| 81 | - uz |
| 82 | - fo |
| 83 | - ht |
| 84 | - ps |
| 85 | - tk |
| 86 | - nn |
| 87 | - mt |
| 88 | - sa |
| 89 | - lb |
| 90 | - my |
| 91 | - bo |
| 92 | - tl |
| 93 | - mg |
| 94 | - as |
| 95 | - tt |
| 96 | - haw |
| 97 | - ln |
| 98 | - ha |
| 99 | - ba |
| 100 | - jw |
| 101 | - su |
| 102 | tags: |
| 103 | - audio |
| 104 | - automatic-speech-recognition |
| 105 | license: mit |
| 106 | library_name: ctranslate2 |
| 107 | --- |
| 108 | |
| 109 | # Whisper tiny model for CTranslate2 |
| 110 | |
| 111 | This repository contains the conversion of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) to the [CTranslate2](https://github.com/OpenNMT/CTranslate2) model format. |
| 112 | |
| 113 | This model can be used in CTranslate2 or projects based on CTranslate2 such as [faster-whisper](https://github.com/systran/faster-whisper). |
| 114 | |
| 115 | ## Example |
| 116 | |
| 117 | ```python |
| 118 | from faster_whisper import WhisperModel |
| 119 | |
| 120 | model = WhisperModel("tiny") |
| 121 | |
| 122 | segments, info = model.transcribe("audio.mp3") |
| 123 | for segment in segments: |
| 124 | print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text)) |
| 125 | ``` |
| 126 | |
| 127 | ## Conversion details |
| 128 | |
| 129 | The original model was converted with the following command: |
| 130 | |
| 131 | ``` |
| 132 | ct2-transformers-converter --model openai/whisper-tiny --output_dir faster-whisper-tiny \ |
| 133 | --copy_files tokenizer.json --quantization float16 |
| 134 | ``` |
| 135 | |
| 136 | Note that the model weights are saved in FP16. This type can be changed when the model is loaded using the [`compute_type` option in CTranslate2](https://opennmt.net/CTranslate2/quantization.html). |
| 137 | |
| 138 | ## More information |
| 139 | |
| 140 | **For more information about the original model, see its [model card](https://huggingface.co/openai/whisper-tiny).** |
| 141 | |