Upload 9 files

Browse files

Files changed (8) hide show

README.md +129 -0
added_tokens.json +104 -0
config.json +114 -0
generation_config.json +186 -0
preprocessor_config.json +111 -0
sentencepiece.bpe.model +3 -0
special_tokens_map.json +112 -0
tokenizer_config.json +938 -0

README.md ADDED Viewed

	@@ -0,0 +1,129 @@

+---
+inference: false
+tags:
+- SeamlessM4T
+- seamless_m4t
+license: cc-by-nc-4.0
+library_name: transformers
+pipeline_tag: text-to-speech
+---
+# SeamlessM4T Large
+SeamlessM4T is a collection of models designed to provide high quality translation, allowing people from different
+linguistic communities to communicate effortlessly through speech and text.
+This repository hosts 🤗 Hugging Face's [implementation](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t) of SeamlessM4T.
+-------------------
+**🌟 SeamlessM4T v2, an improved version of this version with a novel architecture, has been released [here](https://huggingface.co/facebook/seamless-m4t-v2-large).
+This new model improves over SeamlessM4T v1 in quality as well as inference speed in speech generation tasks.**
+**SeamlessM4T v2 is also supported by 🤗 Transformers, more on it [in the model card of this new version](https://huggingface.co/facebook/seamless-m4t-v2-large#transformers-usage) or directly in [🤗 Transformers docs](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t_v2).**
+-------------------
+SeamlessM4T Large covers:
+- 📥 101 languages for speech input
+- ⌨️ [96 Languages](https://huggingface.co/ylacombe/hf-seamless-m4t-large/blob/main/generation_config.json#L48-L145) for text input/output
+- 🗣️ [35 languages](https://huggingface.co/ylacombe/hf-seamless-m4t-large/blob/main/generation_config.json#L149-L184) for speech output.
+This is the "large" variant of the unified model, which enables multiple tasks without relying on multiple separate models:
+- Speech-to-speech translation (S2ST)
+- Speech-to-text translation (S2TT)
+- Text-to-speech translation (T2ST)
+- Text-to-text translation (T2TT)
+- Automatic speech recognition (ASR)
+You can perform all the above tasks from one single model, [`SeamlessM4TModel`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TModel), but each task also has its own dedicated sub-model.
+## 🤗 Usage
+First, load the processor and a checkpoint of the model:
+```python
+>>> from transformers import AutoProcessor, SeamlessM4TModel
+>>> processor = AutoProcessor.from_pretrained("facebook/hf-seamless-m4t-large")
+>>> model = SeamlessM4TModel.from_pretrained("facebook/hf-seamless-m4t-large")
+```
+You can seamlessly use this model on text or on audio, to generated either translated text or translated audio.
+Here is how to use the processor to process text and audio:
+```python
+>>> # let's load an audio sample from an Arabic speech corpus
+>>> from datasets import load_dataset
+>>> dataset = load_dataset("arabic_speech_corpus", split="test", streaming=True)
+>>> audio_sample = next(iter(dataset))["audio"]
+>>> # now, process it
+>>> audio_inputs = processor(audios=audio_sample["array"], return_tensors="pt")
+>>> # now, process some English test as well
+>>> text_inputs = processor(text = "Hello, my dog is cute", src_lang="eng", return_tensors="pt")
+```
+### Speech
+[`SeamlessM4TModel`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TModel) can *seamlessly* generate text or speech with few or no changes. Let's target Russian voice translation:
+```python
+>>> audio_array_from_text = model.generate(**text_inputs, tgt_lang="rus")[0].cpu().numpy().squeeze()
+>>> audio_array_from_audio = model.generate(**audio_inputs, tgt_lang="rus")[0].cpu().numpy().squeeze()
+```
+With basically the same code, I've translated English text and Arabic speech to Russian speech samples.
+### Text
+Similarly, you can generate translated text from audio files or from text with the same model. You only have to pass `generate_speech=False` to [`SeamlessM4TModel.generate`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TModel.generate).
+This time, let's translate to French.
+```python
+>>> # from audio
+>>> output_tokens = model.generate(**audio_inputs, tgt_lang="fra", generate_speech=False)
+>>> translated_text_from_audio = processor.decode(output_tokens[0].tolist(), skip_special_tokens=True)
+>>> # from text
+>>> output_tokens = model.generate(**text_inputs, tgt_lang="fra", generate_speech=False)
+>>> translated_text_from_text = processor.decode(output_tokens[0].tolist(), skip_special_tokens=True)
+```
+### Tips
+#### 1. Use dedicated models
+[`SeamlessM4TModel`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TModel) is transformers top level model to generate speech and text, but you can also use dedicated models that perform the task without additional components, thus reducing the memory footprint.
+For example, you can replace the audio-to-audio generation snippet with the model dedicated to the S2ST task, the rest is exactly the same code:
+```python
+>>> from transformers import SeamlessM4TForSpeechToSpeech
+>>> model = SeamlessM4TForSpeechToSpeech.from_pretrained("facebook/hf-seamless-m4t-large")
+```
+Or you can replace the text-to-text generation snippet with the model dedicated to the T2TT task, you only have to remove `generate_speech=False`.
+```python
+>>> from transformers import SeamlessM4TForTextToText
+>>> model = SeamlessM4TForTextToText.from_pretrained("facebook/hf-seamless-m4t-large")
+```
+Feel free to try out [`SeamlessM4TForSpeechToText`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TForSpeechToText) and [`SeamlessM4TForTextToSpeech`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TForTextToSpeech) as well.
+#### 2. Change the speaker identity
+You have the possibility to change the speaker used for speech synthesis with the `spkr_id` argument. Some `spkr_id` works better than other for some languages!
+#### 3. Change the generation strategy
+You can use different [generation strategies](https://huggingface.co/docs/transformers/v4.34.1/en/generation_strategies#text-generation-strategies) for speech and text generation, e.g `.generate(input_ids=input_ids, text_num_beams=4, speech_do_sample=True)` which will successively perform beam-search decoding on the text model, and multinomial sampling on the speech model.
+#### 4. Generate speech and text at the same time
+Use `return_intermediate_token_ids=True` with [`SeamlessM4TModel`](https://huggingface.co/docs/transformers/main/en/model_doc/seamless_m4t#transformers.SeamlessM4TModel) to return both speech and text !

added_tokens.json ADDED Viewed

	@@ -0,0 +1,104 @@

+{
+  "</s>": 3,
+  "<pad>": 0,
+  "<s>": 2,
+  "<unk>": 1,
+  "__afr__": 256001,
+  "__amh__": 256002,
+  "__arb__": 256003,
+  "__ary__": 256004,
+  "__arz__": 256005,
+  "__asm__": 256006,
+  "__azj__": 256007,
+  "__bel__": 256008,
+  "__ben__": 256009,
+  "__bos__": 256010,
+  "__bul__": 256011,
+  "__cat__": 256012,
+  "__ceb__": 256013,
+  "__ces__": 256014,
+  "__ckb__": 256015,
+  "__cmn_Hant__": 256017,
+  "__cmn__": 256016,
+  "__cym__": 256018,
+  "__dan__": 256019,
+  "__deu__": 256020,
+  "__ell__": 256021,
+  "__eng__": 256022,
+  "__est__": 256023,
+  "__eus__": 256024,
+  "__fin__": 256025,
+  "__fra__": 256026,
+  "__fuv__": 256027,
+  "__gaz__": 256028,
+  "__gle__": 256029,
+  "__glg__": 256030,
+  "__guj__": 256031,
+  "__heb__": 256032,
+  "__hin__": 256033,
+  "__hrv__": 256034,
+  "__hun__": 256035,
+  "__hye__": 256036,
+  "__ibo__": 256037,
+  "__ind__": 256038,
+  "__isl__": 256039,
+  "__ita__": 256040,
+  "__jav__": 256041,
+  "__jpn__": 256042,
+  "__kan__": 256043,
+  "__kat__": 256044,
+  "__kaz__": 256045,
+  "__khk__": 256046,
+  "__khm__": 256047,
+  "__kir__": 256048,
+  "__kor__": 256049,
+  "__lao__": 256050,
+  "__lit__": 256051,
+  "__lug__": 256052,
+  "__luo__": 256053,
+  "__lvs__": 256054,
+  "__mai__": 256055,
+  "__mal__": 256056,
+  "__mar__": 256057,
+  "__mkd__": 256058,
+  "__mlt__": 256059,
+  "__mni__": 256060,
+  "__mya__": 256061,
+  "__nld__": 256062,
+  "__nno__": 256063,
+  "__nob__": 256064,
+  "__npi__": 256065,
+  "__nya__": 256066,
+  "__ory__": 256067,
+  "__pan__": 256068,
+  "__pbt__": 256069,
+  "__pes__": 256070,
+  "__pol__": 256071,
+  "__por__": 256072,
+  "__ron__": 256073,
+  "__rus__": 256074,
+  "__sat__": 256075,
+  "__slk__": 256076,
+  "__slv__": 256077,
+  "__sna__": 256078,
+  "__snd__": 256079,
+  "__som__": 256080,
+  "__spa__": 256081,
+  "__srp__": 256082,
+  "__swe__": 256083,
+  "__swh__": 256084,
+  "__tam__": 256085,
+  "__tel__": 256086,
+  "__tgk__": 256087,
+  "__tgl__": 256088,
+  "__tha__": 256089,
+  "__tur__": 256090,
+  "__ukr__": 256091,
+  "__urd__": 256092,
+  "__uzn__": 256093,
+  "__vie__": 256094,
+  "__yor__": 256095,
+  "__yue__": 256096,
+  "__zlm__": 256097,
+  "__zul__": 256098
+}

config.json ADDED Viewed

	@@ -0,0 +1,114 @@

+{
+  "activation_dropout": 0.0,
+  "activation_function": "relu",
+  "adaptor_dropout": 0.1,
+  "adaptor_kernel_size": 8,
+  "adaptor_stride": 8,
+  "add_adapter": true,
+  "architectures": [
+    "SeamlessM4TModel"
+  ],
+  "attention_dropout": 0.1,
+  "bos_token_id": 2,
+  "conv_depthwise_kernel_size": 31,
+  "decoder_attention_heads": 16,
+  "decoder_ffn_dim": 8192,
+  "decoder_layerdrop": 0.05,
+  "decoder_layers": 24,
+  "decoder_start_token_id": 3,
+  "dropout": 0.1,
+  "encoder_attention_heads": 16,
+  "encoder_ffn_dim": 8192,
+  "encoder_layerdrop": 0.05,
+  "encoder_layers": 24,
+  "eos_token_id": 3,
+  "feature_projection_input_dim": 160,
+  "hidden_size": 1024,
+  "initializer_range": 0.02,
+  "is_encoder_decoder": true,
+  "lang_embed_dim": 256,
+  "layer_norm_eps": 1e-05,
+  "leaky_relu_slope": 0.1,
+  "max_new_tokens": 256,
+  "max_position_embeddings": 1024,
+  "max_source_positions": 4096,
+  "model_type": "seamless_m4t",
+  "num_adapter_layers": 1,
+  "num_attention_heads": 16,
+  "num_conv_pos_embedding_groups": 16,
+  "num_conv_pos_embeddings": 128,
+  "num_hidden_layers": 24,
+  "pad_token_id": 0,
+  "position_embeddings_type": "relative",
+  "resblock_dilation_sizes": [
+    [
+      1,
+      3,
+      5
+    ],
+    [
+      1,
+      3,
+      5
+    ],
+    [
+      1,
+      3,
+      5
+    ]
+  ],
+  "resblock_kernel_sizes": [
+    3,
+    7,
+    11
+  ],
+  "rotary_embedding_base": 10000,
+  "sampling_rate": 16000,
+  "scale_embedding": true,
+  "speech_encoder_attention_heads": 16,
+  "speech_encoder_dropout": 0.0,
+  "speech_encoder_hidden_act": "swish",
+  "speech_encoder_intermediate_size": 4096,
+  "speech_encoder_layerdrop": 0.1,
+  "speech_encoder_layers": 24,
+  "spkr_embed_dim": 256,
+  "t2u_bos_token_id": 0,
+  "t2u_decoder_attention_heads": 16,
+  "t2u_decoder_ffn_dim": 8192,
+  "t2u_decoder_layers": 6,
+  "t2u_decoder_start_token_id": 2,
+  "t2u_encoder_attention_heads": 16,
+  "t2u_encoder_ffn_dim": 8192,
+  "t2u_encoder_layers": 6,
+  "t2u_eos_token_id": 2,
+  "t2u_max_new_tokens": 1024,
+  "t2u_max_position_embeddings": 2048,
+  "t2u_pad_token_id": 1,
+  "t2u_vocab_size": 10082,
+  "torch_dtype": "float32",
+  "transformers_version": "4.35.0.dev0",
+  "unit_embed_dim": 1280,
+  "unit_hifi_gan_vocab_size": 10000,
+  "upsample_initial_channel": 512,
+  "upsample_kernel_sizes": [
+    11,
+    8,
+    8,
+    4,
+    4
+  ],
+  "upsample_rates": [
+    5,
+    4,
+    4,
+    2,
+    2
+  ],
+  "use_cache": true,
+  "var_pred_dropout": 0.5,
+  "variance_predictor_kernel_size": 3,
+  "vocab_size": 256102,
+  "vocoder_num_langs": 36,
+  "vocoder_num_spkrs": 200,
+  "vocoder_offset": 4
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,186 @@

+{
+  "bos_token_id": 2,
+  "decoder_start_token_id": 3,
+  "eos_token_id": 3,
+  "max_new_tokens": 256,
+  "pad_token_id": 0,
+  "t2u_lang_code_to_id": {
+    "arb": 10043,
+    "ben": 10044,
+    "cat": 10045,
+    "ces": 10046,
+    "cmn": 10047,
+    "cym": 10048,
+    "dan": 10049,
+    "deu": 10050,
+    "eng": 10051,
+    "est": 10052,
+    "fin": 10053,
+    "fra": 10054,
+    "hin": 10055,
+    "ind": 10056,
+    "ita": 10057,
+    "jpn": 10058,
+    "kan": 10059,
+    "kor": 10060,
+    "mlt": 10061,
+    "nld": 10062,
+    "pes": 10063,
+    "pol": 10064,
+    "por": 10065,
+    "ron": 10066,
+    "rus": 10067,
+    "slk": 10068,
+    "spa": 10069,
+    "swe": 10070,
+    "swh": 10071,
+    "tam": 10072,
+    "tel": 10073,
+    "tgl": 10074,
+    "tha": 10075,
+    "tur": 10076,
+    "ukr": 10077,
+    "urd": 10078,
+    "uzn": 10079,
+    "vie": 10080
+  },
+  "text_decoder_lang_to_code_id": {
+    "afr": 256001,
+    "amh": 256002,
+    "arb": 256003,
+    "ary": 256004,
+    "arz": 256005,
+    "asm": 256006,
+    "azj": 256007,
+    "bel": 256008,
+    "ben": 256009,
+    "bos": 256010,
+    "bul": 256011,
+    "cat": 256012,
+    "ceb": 256013,
+    "ces": 256014,
+    "ckb": 256015,
+    "cmn": 256016,
+    "cmn_Hant": 256017,
+    "cym": 256018,
+    "dan": 256019,
+    "deu": 256020,
+    "ell": 256021,
+    "eng": 256022,
+    "est": 256023,
+    "eus": 256024,
+    "fin": 256025,
+    "fra": 256026,
+    "fuv": 256027,
+    "gaz": 256028,
+    "gle": 256029,
+    "glg": 256030,
+    "guj": 256031,
+    "heb": 256032,
+    "hin": 256033,
+    "hrv": 256034,
+    "hun": 256035,
+    "hye": 256036,
+    "ibo": 256037,
+    "ind": 256038,
+    "isl": 256039,
+    "ita": 256040,
+    "jav": 256041,
+    "jpn": 256042,
+    "kan": 256043,
+    "kat": 256044,
+    "kaz": 256045,
+    "khk": 256046,
+    "khm": 256047,
+    "kir": 256048,
+    "kor": 256049,
+    "lao": 256050,
+    "lit": 256051,
+    "lug": 256052,
+    "luo": 256053,
+    "lvs": 256054,
+    "mai": 256055,
+    "mal": 256056,
+    "mar": 256057,
+    "mkd": 256058,
+    "mlt": 256059,
+    "mni": 256060,
+    "mya": 256061,
+    "nld": 256062,
+    "nno": 256063,
+    "nob": 256064,
+    "npi": 256065,
+    "nya": 256066,
+    "ory": 256067,
+    "pan": 256068,
+    "pbt": 256069,
+    "pes": 256070,
+    "pol": 256071,
+    "por": 256072,
+    "ron": 256073,
+    "rus": 256074,
+    "sat": 256075,
+    "slk": 256076,
+    "slv": 256077,
+    "sna": 256078,
+    "snd": 256079,
+    "som": 256080,
+    "spa": 256081,
+    "srp": 256082,
+    "swe": 256083,
+    "swh": 256084,
+    "tam": 256085,
+    "tel": 256086,
+    "tgk": 256087,
+    "tgl": 256088,
+    "tha": 256089,
+    "tur": 256090,
+    "ukr": 256091,
+    "urd": 256092,
+    "uzn": 256093,
+    "vie": 256094,
+    "yor": 256095,
+    "yue": 256096,
+    "zlm": 256097,
+    "zul": 256098
+  },
+  "transformers_version": "4.35.0.dev0",
+  "vocoder_lang_code_to_id": {
+    "arb": 0,
+    "ben": 1,
+    "cat": 2,
+    "ces": 3,
+    "cmn": 4,
+    "cym": 5,
+    "dan": 6,
+    "deu": 7,
+    "eng": 8,
+    "est": 9,
+    "fin": 10,
+    "fra": 11,
+    "hin": 12,
+    "ind": 13,
+    "ita": 14,
+    "jpn": 15,
+    "kor": 16,
+    "mlt": 17,
+    "nld": 18,
+    "pes": 19,
+    "pol": 20,
+    "por": 21,
+    "ron": 22,
+    "rus": 23,
+    "slk": 24,
+    "spa": 25,
+    "swe": 26,
+    "swh": 27,
+    "tel": 28,
+    "tgl": 29,
+    "tha": 30,
+    "tur": 31,
+    "ukr": 32,
+    "urd": 33,
+    "uzn": 34,
+    "vie": 35
+  }
+}

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,111 @@

+{
+  "feature_extractor_type": "SeamlessM4TFeatureExtractor",
+  "feature_size": 80,
+  "language_code": [
+    "__afr__",
+    "__amh__",
+    "__arb__",
+    "__ary__",
+    "__arz__",
+    "__asm__",
+    "__azj__",
+    "__bel__",
+    "__ben__",
+    "__bos__",
+    "__bul__",
+    "__cat__",
+    "__ceb__",
+    "__ces__",
+    "__ckb__",
+    "__cmn__",
+    "__cmn_Hant__",
+    "__cym__",
+    "__dan__",
+    "__deu__",
+    "__ell__",
+    "__eng__",
+    "__est__",
+    "__eus__",
+    "__fin__",
+    "__fra__",
+    "__fuv__",
+    "__gaz__",
+    "__gle__",
+    "__glg__",
+    "__guj__",
+    "__heb__",
+    "__hin__",
+    "__hrv__",
+    "__hun__",
+    "__hye__",
+    "__ibo__",
+    "__ind__",
+    "__isl__",
+    "__ita__",
+    "__jav__",
+    "__jpn__",
+    "__kan__",
+    "__kat__",
+    "__kaz__",
+    "__khk__",
+    "__khm__",
+    "__kir__",
+    "__kor__",
+    "__lao__",
+    "__lit__",
+    "__lug__",
+    "__luo__",
+    "__lvs__",
+    "__mai__",
+    "__mal__",
+    "__mar__",
+    "__mkd__",
+    "__mlt__",
+    "__mni__",
+    "__mya__",
+    "__nld__",
+    "__nno__",
+    "__nob__",
+    "__npi__",
+    "__nya__",
+    "__ory__",
+    "__pan__",
+    "__pbt__",
+    "__pes__",
+    "__pol__",
+    "__por__",
+    "__ron__",
+    "__rus__",
+    "__sat__",
+    "__slk__",
+    "__slv__",
+    "__sna__",
+    "__snd__",
+    "__som__",
+    "__spa__",
+    "__srp__",
+    "__swe__",
+    "__swh__",
+    "__tam__",
+    "__tel__",
+    "__tgk__",
+    "__tgl__",
+    "__tha__",
+    "__tur__",
+    "__ukr__",
+    "__urd__",
+    "__uzn__",
+    "__vie__",
+    "__yor__",
+    "__yue__",
+    "__zlm__",
+    "__zul__"
+  ],
+  "num_mel_bins": 80,
+  "padding_side": "right",
+  "padding_value": 0.0,
+  "processor_class": "SeamlessM4TProcessor",
+  "return_attention_mask": true,
+  "sampling_rate": 16000,
+  "stride": 2
+}

sentencepiece.bpe.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:026a76827537db9f1348e4d5aaa127bb10a2f2ff633243f3a52d16be82d73f9d
+size 5165809

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,112 @@

+{
+  "additional_special_tokens": [
+    "<pad>",
+    "<unk>",
+    "<s>",
+    "</s>",
+    "__afr__",
+    "__amh__",
+    "__arb__",
+    "__ary__",
+    "__arz__",
+    "__asm__",
+    "__azj__",
+    "__bel__",
+    "__ben__",
+    "__bos__",
+    "__bul__",
+    "__cat__",
+    "__ceb__",
+    "__ces__",
+    "__ckb__",
+    "__cmn__",
+    "__cmn_Hant__",
+    "__cym__",
+    "__dan__",
+    "__deu__",
+    "__ell__",
+    "__eng__",
+    "__est__",
+    "__eus__",
+    "__fin__",
+    "__fra__",
+    "__fuv__",
+    "__gaz__",
+    "__gle__",
+    "__glg__",
+    "__guj__",
+    "__heb__",
+    "__hin__",
+    "__hrv__",
+    "__hun__",
+    "__hye__",
+    "__ibo__",
+    "__ind__",
+    "__isl__",
+    "__ita__",
+    "__jav__",
+    "__jpn__",
+    "__kan__",
+    "__kat__",
+    "__kaz__",
+    "__khk__",
+    "__khm__",
+    "__kir__",
+    "__kor__",
+    "__lao__",
+    "__lit__",
+    "__lug__",
+    "__luo__",
+    "__lvs__",
+    "__mai__",
+    "__mal__",
+    "__mar__",
+    "__mkd__",
+    "__mlt__",
+    "__mni__",
+    "__mya__",
+    "__nld__",
+    "__nno__",
+    "__nob__",
+    "__npi__",
+    "__nya__",
+    "__ory__",
+    "__pan__",
+    "__pbt__",
+    "__pes__",
+    "__pol__",
+    "__por__",
+    "__ron__",
+    "__rus__",
+    "__sat__",
+    "__slk__",
+    "__slv__",
+    "__sna__",
+    "__snd__",
+    "__som__",
+    "__spa__",
+    "__srp__",
+    "__swe__",
+    "__swh__",
+    "__tam__",
+    "__tel__",
+    "__tgk__",
+    "__tgl__",
+    "__tha__",
+    "__tur__",
+    "__ukr__",
+    "__urd__",
+    "__uzn__",
+    "__vie__",
+    "__yor__",
+    "__yue__",
+    "__zlm__",
+    "__zul__"
+  ],
+  "bos_token": "<s>",
+  "cls_token": "<s>",
+  "eos_token": "</s>",
+  "pad_token": "<pad>",
+  "sep_token": "</s>",
+  "unk_token": "<unk>"
+}

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,938 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<pad>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "256001": {
+      "content": "__afr__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256002": {
+      "content": "__amh__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256003": {
+      "content": "__arb__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256004": {
+      "content": "__ary__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256005": {
+      "content": "__arz__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256006": {
+      "content": "__asm__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256007": {
+      "content": "__azj__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256008": {
+      "content": "__bel__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256009": {
+      "content": "__ben__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256010": {
+      "content": "__bos__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256011": {
+      "content": "__bul__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256012": {
+      "content": "__cat__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256013": {
+      "content": "__ceb__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256014": {
+      "content": "__ces__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256015": {
+      "content": "__ckb__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256016": {
+      "content": "__cmn__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256017": {
+      "content": "__cmn_Hant__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256018": {
+      "content": "__cym__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256019": {
+      "content": "__dan__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256020": {
+      "content": "__deu__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256021": {
+      "content": "__ell__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256022": {
+      "content": "__eng__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256023": {
+      "content": "__est__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256024": {
+      "content": "__eus__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256025": {
+      "content": "__fin__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256026": {
+      "content": "__fra__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256027": {
+      "content": "__fuv__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256028": {
+      "content": "__gaz__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256029": {
+      "content": "__gle__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256030": {
+      "content": "__glg__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256031": {
+      "content": "__guj__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256032": {
+      "content": "__heb__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256033": {
+      "content": "__hin__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256034": {
+      "content": "__hrv__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256035": {
+      "content": "__hun__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256036": {
+      "content": "__hye__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256037": {
+      "content": "__ibo__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256038": {
+      "content": "__ind__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256039": {
+      "content": "__isl__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256040": {
+      "content": "__ita__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256041": {
+      "content": "__jav__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256042": {
+      "content": "__jpn__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256043": {
+      "content": "__kan__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256044": {
+      "content": "__kat__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256045": {
+      "content": "__kaz__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256046": {
+      "content": "__khk__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256047": {
+      "content": "__khm__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256048": {
+      "content": "__kir__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256049": {
+      "content": "__kor__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256050": {
+      "content": "__lao__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256051": {
+      "content": "__lit__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256052": {
+      "content": "__lug__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256053": {
+      "content": "__luo__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256054": {
+      "content": "__lvs__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256055": {
+      "content": "__mai__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256056": {
+      "content": "__mal__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256057": {
+      "content": "__mar__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256058": {
+      "content": "__mkd__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256059": {
+      "content": "__mlt__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256060": {
+      "content": "__mni__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256061": {
+      "content": "__mya__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256062": {
+      "content": "__nld__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256063": {
+      "content": "__nno__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256064": {
+      "content": "__nob__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256065": {
+      "content": "__npi__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256066": {
+      "content": "__nya__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256067": {
+      "content": "__ory__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256068": {
+      "content": "__pan__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256069": {
+      "content": "__pbt__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256070": {
+      "content": "__pes__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256071": {
+      "content": "__pol__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256072": {
+      "content": "__por__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256073": {
+      "content": "__ron__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256074": {
+      "content": "__rus__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256075": {
+      "content": "__sat__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256076": {
+      "content": "__slk__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256077": {
+      "content": "__slv__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256078": {
+      "content": "__sna__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256079": {
+      "content": "__snd__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256080": {
+      "content": "__som__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256081": {
+      "content": "__spa__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256082": {
+      "content": "__srp__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256083": {
+      "content": "__swe__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256084": {
+      "content": "__swh__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256085": {
+      "content": "__tam__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256086": {
+      "content": "__tel__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256087": {
+      "content": "__tgk__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256088": {
+      "content": "__tgl__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256089": {
+      "content": "__tha__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256090": {
+      "content": "__tur__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256091": {
+      "content": "__ukr__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256092": {
+      "content": "__urd__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256093": {
+      "content": "__uzn__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256094": {
+      "content": "__vie__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256095": {
+      "content": "__yor__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256096": {
+      "content": "__yue__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256097": {
+      "content": "__zlm__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    },
+    "256098": {
+      "content": "__zul__",
+      "lstrip": true,
+      "normalized": false,
+      "rstrip": true,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [
+    "<pad>",
+    "<unk>",
+    "<s>",
+    "</s>",
+    "__afr__",
+    "__amh__",
+    "__arb__",
+    "__ary__",
+    "__arz__",
+    "__asm__",
+    "__azj__",
+    "__bel__",
+    "__ben__",
+    "__bos__",
+    "__bul__",
+    "__cat__",
+    "__ceb__",
+    "__ces__",
+    "__ckb__",
+    "__cmn__",
+    "__cmn_Hant__",
+    "__cym__",
+    "__dan__",
+    "__deu__",
+    "__ell__",
+    "__eng__",
+    "__est__",
+    "__eus__",
+    "__fin__",
+    "__fra__",
+    "__fuv__",
+    "__gaz__",
+    "__gle__",
+    "__glg__",
+    "__guj__",
+    "__heb__",
+    "__hin__",
+    "__hrv__",
+    "__hun__",
+    "__hye__",
+    "__ibo__",
+    "__ind__",
+    "__isl__",
+    "__ita__",
+    "__jav__",
+    "__jpn__",
+    "__kan__",
+    "__kat__",
+    "__kaz__",
+    "__khk__",
+    "__khm__",
+    "__kir__",
+    "__kor__",
+    "__lao__",
+    "__lit__",
+    "__lug__",
+    "__luo__",
+    "__lvs__",
+    "__mai__",
+    "__mal__",
+    "__mar__",
+    "__mkd__",
+    "__mlt__",
+    "__mni__",
+    "__mya__",
+    "__nld__",
+    "__nno__",
+    "__nob__",
+    "__npi__",
+    "__nya__",
+    "__ory__",
+    "__pan__",
+    "__pbt__",
+    "__pes__",
+    "__pol__",
+    "__por__",
+    "__ron__",
+    "__rus__",
+    "__sat__",
+    "__slk__",
+    "__slv__",
+    "__sna__",
+    "__snd__",
+    "__som__",
+    "__spa__",
+    "__srp__",
+    "__swe__",
+    "__swh__",
+    "__tam__",
+    "__tel__",
+    "__tgk__",
+    "__tgl__",
+    "__tha__",
+    "__tur__",
+    "__ukr__",
+    "__urd__",
+    "__uzn__",
+    "__vie__",
+    "__yor__",
+    "__yue__",
+    "__zlm__",
+    "__zul__"
+  ],
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "<s>",
+  "eos_token": "</s>",
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "<pad>",
+  "processor_class": "SeamlessM4TProcessor",
+  "sep_token": "</s>",
+  "sp_model_kwargs": {},
+  "src_lang": "__eng__",
+  "tgt_lang": "__fra__",
+  "tokenizer_class": "SeamlessM4TTokenizer",
+  "tokenizer_file": null,
+  "unk_token": "<unk>"
+}