Latxa: An Open Language Model and Evaluation Suite for Basque

HiTZ zentroa
non-profit
AI & ML interests
Natural Language Processing, Signal Processing
Recent Activity
Organization Card
HiTZ is a reference center on Language Technologies whose aim is to promote research, training, technological transfer and innovation in Artificial Intelligence focused on language and speech. It is a multidisciplinary team composed of people with backgrounds in computer science, linguistics and engineering, coming from the Ixa and Aholab research groups, both from the University of the Basque Country (UPV/EHU). These groups are -since their creation in 1988 and 1995 respectively- the main driving forces in the area of Language Technologies in the Basque Country.
Collections
22
spaces
3
models
71

HiTZ/BERnaT-base
Fill-Mask
•
Updated

HiTZ/BERnaT-medium-NERC
Token Classification
•
Updated
•
26

HiTZ/BERnaT-base-NERC
Token Classification
•
Updated
•
30

HiTZ/BERnaT-large-NERC
Token Classification
•
Updated
•
22

HiTZ/Latxa-Llama-3.1-8B-Instruct
Text Generation
•
Updated
•
397
•
7

HiTZ/Qwen2.5-14B-Instruct_ODESIA
Text Generation
•
Updated
•
15

HiTZ/pyannote-segmentation-3.0-RTVE
Automatic Speech Recognition
•
Updated
•
16

HiTZ/judge-eus
Text Generation
•
Updated
•
4.38k
•
2

HiTZ/gemma-2b-it_ODESIA
Text Generation
•
Updated
•
40

HiTZ/Hermes-3-Llama-3.1-8B_ODESIA
Text Generation
•
Updated
•
19
datasets
42
HiTZ/laion-eus
Viewer
•
Updated
•
221k
•
37
•
1
HiTZ/wnli-eu
Viewer
•
Updated
•
217
•
220
HiTZ/XCOPA-eu
Viewer
•
Updated
•
600
•
187
HiTZ/PIQA-eu
Viewer
•
Updated
•
1.84k
•
158
HiTZ/MGSM-eu
Viewer
•
Updated
•
258
•
296
HiTZ/ARC-eu
Viewer
•
Updated
•
4.42k
•
251
HiTZ/PAWS-eu
Viewer
•
Updated
•
2k
•
97
HiTZ/EN2CS
Viewer
•
Updated
•
12.5k
•
70
HiTZ/composite_corpus_es_v1.0
Viewer
•
Updated
•
526k
•
497
HiTZ/composite_corpus_eu_v2.1
Viewer
•
Updated
•
407k
•
133