Kaito Sugimoto's picture

Kaito Sugimoto

kaisugi

·

https://kaisugi.me

kaisugi

AI & ML interests

Japanese LLMs

Recent Activity

liked a model 9 days ago

Aratako/sarashina2.1-1b-sft

liked a model 11 days ago

sbintuitions/sarashina-embedding-v1-1b

liked a Space 11 days ago

LGAI-EXAONE/EXAONE-3.5-Instruct-Demo

View all activity

Organizations

kaisugi's activity

upvoted 2 collections about 1 month ago

LLM-jp-3 Fine-tuned Models

Fine-tuned models in the LLM-jp-3 model series • 5 items • Updated Nov 15 • 1

LLM-jp-3 Pre-trained Models

Pre-trained models in the LLM-jp-3 model series • 5 items • Updated Nov 15 • 4

upvoted an article 2 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 127

upvoted a collection 2 months ago

Llama-3.1-Swallow

6 items • Updated Nov 11 • 4

upvoted a paper 2 months ago

PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency

Paper • 2410.07563 • Published Oct 10 • 2

upvoted 3 collections 3 months ago

gemma-2-baku

The baku model series are based on the gemma-2 series and have been continually pre-trained on Japanese-specific corpora. • 4 items • Updated 16 days ago • 3

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated 8 days ago • 26

Borea

3 items • Updated Aug 21 • 2

upvoted a paper 3 months ago

Ruri: Japanese General Text Embeddings

Paper • 2409.07737 • Published Sep 12 • 7

upvoted 3 collections 4 months ago

Ruri: Japanese General Text Embeddings

18 items • Updated Sep 13 • 13

Japanese SimCSE

Tsukagoshi et al., Japanese SimCSE Technical Report, arXiv 2023. https://arxiv.org/abs/2310.19349 • 5 items • Updated Sep 4 • 2

Japanese Retrieval

3 items • Updated Aug 20 • 3

upvoted 4 collections 5 months ago

llama-3-youko

The youko model series are based on the llama-3 series and have been continually pre-trained on Japanese-specific corpora. • 9 items • Updated 16 days ago • 1

EZO

18 items • Updated Oct 3 • 2

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26 • 56

Sarashina

Large Language Models developed by SB Intuitions • 8 items • Updated 9 days ago • 5

upvoted a paper 5 months ago

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Paper • 2407.03963 • Published Jul 4 • 15

upvoted 3 collections 6 months ago

DeBERTa V3

6 items • Updated Jul 5 • 1

neoAI LLM

1 item • Updated Jun 26 • 1

Llama-3-Swallow

4 items • Updated Jul 1 • 4