--- license: apache-2.0 --- This repo offers a set of fairseq roberta based models fine tuned to specific NLP tasks. These models are: - Small - Easy to host on T4 or V100 - 100x faster than using LLMs for similar tasks - Easy to fine tune All the models below were trained at Nlmatics Corp. from 2019-2023 with base model from: https://github.com/facebookresearch/fairseq/blob/main/examples/roberta/README.md ### To run the models: Use https://github.com/nlmatics/nlm-model-service ### To acccess the models Use https://github.com/nlmatics/nlm-utils ### To train the models TBD ## List of Models Click on each model to see details: ### roberta.large.boolq *Location:* [roberta.large.boolq](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.boolq) Trained with MNLI + Boolq *Trained by:* Evan Li *Application:* Given a passage and a question, answer the question with yes, no or unsure. *Training Process:* https://blogs.nlmatics.com/2020/03/12/Boolean-Question-Answering-with-Neutral-Labels.html ### roberta.large.qa See folder: [roberta.large.qa](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.qa) Trained with SQuAD 2.0 + Custom Dataset preferring shorter spans better suited for data extraction *Trained by:* Ambika Sukla *Application:* Given a passage and a question, pick the shortest span from the passage that answers the question *Training Process:* start, end location head on the top of Roberta Base ### roberta.large.stsb *See folder:* [roberta.large.stsb](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.stsb) Trained with STSB dataset *Trained by:* Meta/Fairseq *Application:* Given two passages, return a score beteen 0 and 1 to evaluate their similarity *Training Process:* regression head on top of Roberta Base ### roberta.large.phraseqa *See folder:* [roberta.large.phraseqa](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.phraseqa) Trained with Roberta 2.0 with the question words removed from the question *Trained By:* Batya Stein *Application:* Given a passage and phrase (key), extract a value from the passage *Training Process:* https://blogs.nlmatics.com/2020/08/25/Optimizing-Transformer-Q&A-Models-for-Naturalistic-Search.html ### roberta.large.qasrl *See folder:* [roberta.large.qasrl](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.qasrl) Trained with QASRL dataset *Application:* Given a passage, get back values for who, what, when, where etc. *Trained By:* Nima Sheikholeslami ### roberta.large.qatype.lower.RothWithQ *See folder:* [roberta.large.qatype.lower.RothWithQ](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.qatype.lower.RothWithQ) Trained with the Roth Question Type dataset. *Application:* Given a question, return one of the answer types e.g. number, location. See the Roth dataset for full list. *Trained By:* Evan Li ### roberta.large.io_qa See folder: [roberta.large.io_qa](https://huggingface.co/ansukla/roberta/tree/main/roberta.large.io_qa) Trained with SQuAD 2.0 dataset *Trained By:* Nima Sheikholeslami *Training Process:* Use io head to support multiple spans.