runtime error
Exit code: 1. Reason: 0<00:00, 485224.98 examples/s] /usr/local/lib/python3.10/site-packages/transformers/tokenization_utils_base.py:1601: FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884 warnings.warn( Map: 0%| | 0/16853 [00:00<?, ? examples/s][A Map: 0%| | 0/16853 [00:00<?, ? examples/s] Traceback (most recent call last): File "/home/user/app/app.py", line 17, in <module> tokenized_datasets = dataset.map(tokenize_function, batched=True) File "/usr/local/lib/python3.10/site-packages/datasets/dataset_dict.py", line 869, in map { File "/usr/local/lib/python3.10/site-packages/datasets/dataset_dict.py", line 870, in <dictcomp> k: dataset.map( File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 602, in wrapper out: Union["Dataset", "DatasetDict"] = func(self, *args, **kwargs) File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 567, in wrapper out: Union["Dataset", "DatasetDict"] = func(self, *args, **kwargs) File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3167, in map for rank, done, content in Dataset._map_single(**dataset_kwargs): File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3558, in _map_single batch = apply_function_on_filtered_inputs( File "/usr/local/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3427, in apply_function_on_filtered_inputs processed_inputs = function(*fn_args, *additional_args, **fn_kwargs) File "/home/user/app/app.py", line 15, in tokenize_function return tokenizer(examples["text"], truncation=True, padding="max_length") File "/usr/local/lib/python3.10/site-packages/datasets/formatting/formatting.py", line 277, in __getitem__ value = self.data[key] KeyError: 'text'
Container logs:
Fetching error logs...