Spaces:

jesseplusplus
/

easy-translate

Running

App Files Files Community

easy-translate / examples /nllb200-moe-54B_1GPU_8bits.sh

Iker's picture

Add support for SMALL100 and update examples

911ab5a over 1 year ago

history blame contribute delete

716 Bytes

	# Run NLLB200-MOE model on sample text. This is a huge model that doesn't fit on a single GPU, so we use
	# 8-bit quantization to reduce the required VRAM. Still it might not fit on a single GPU, so we also use
	# the --force_auto_device_map flag that will offload the model parameters that don't fit on the GPU to the CPU.
	# If 8-bit quantization is not enough, you can use 4-bit quantization, see examples/nllb200-moe-54B_1GPU_4bits.sh

	python3 translate.py \
	--sentences_path sample_text/en.txt \
	--output_path sample_text/en2es.translation.nllb200-moe-54B.txt \
	--source_lang eng_Latn \
	--target_lang spa_Latn \
	--model_name facebook/nllb-moe-54b \
	--precision 8 \
	--force_auto_device_map \
	--starting_batch_size 8