File size: 489 Bytes
b9f719c
b459d0c
 
 
 
 
 
 
 
 
 
 
 
 
3ed322a
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
This model is funetune version of Codebert in roberta. On CodeSearchNet.
###
Quick start:

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("addy88/programming-lang-identifier")

model = AutoModelForSequenceClassification.from_pretrained("addy88/programming-lang-identifier")

input_ids = tokenizer.encode(CODE_TO_IDENTIFY)
logits = model(input_ids)[0]

language_idx = logits.argmax() # index for the resulting label
###