update inference with transformers

#162

transforming the list of tokens into a pytorch tensor and adding device map to load from disk to gpu directly

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment