5 9

Sungwoo Oh

sackoh

sackoh

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

NCSOFT/Llama-VARCO-8B-Instruct:Add Optional `add_generation_prompt` Parameter to control final generation prompt in `chat_template`

View all activity

Organizations

sackoh's activity

New activity in NCSOFT/Llama-VARCO-8B-Instruct about 1 month ago

Add Optional `add_generation_prompt` Parameter to control final generation prompt in `chat_template`

#3 opened about 1 month ago by

sackoh

liked a dataset 4 months ago

allenai/aboutme

Viewer • Updated Jan 16 • 22M • 47 • 10

liked a Space 5 months ago

Running

148

🥇

BigCodeBench Leaderboard

posted an update 7 months ago

Post

700

🚀 Release of open-source Korean LLM: GECKO-7B

I am delighted to share my recent project, GECKO, a bilingual large language model for Korean and English 🇰🇷🇺🇸. This initiative was inspired by the lack of resources for Korean large language models.

@donggyukimc and I wrote the technical report to share our insights and experiences of developing our model. While our model may not achieve sate-of-the-art performance on all benchmarks, it shows modest results with a relatively small amount of pretrained tokens.

I hope GECKO contribute to the open-source community, offering resources that can built upon and improved. I believe that through collaboration and shared knowledge, we can advance the capabilities and accessibility of large language models for Korean and other low-resource languages.

🤗 Model: kifai/GECKO-7B
📄 Technical Report: https://arxiv.org/pdf/2405.15640

2 replies

updated a model 7 months ago

kifai/GECKO-7B

Text Generation • Updated Jun 3 • 11

New activity in kifai/GECKO-7B 7 months ago

Usage의 model id에 오타가 있습니다.

#1 opened 7 months ago by

MrBananaHuman

authored a paper 7 months ago

GECKO: Generative Language Model for English, Code and Korean

Paper • 2405.15640 • Published May 24

updated a Space 7 months ago

Running

🚀

한국금융인공지능연구원(KIFAI)

Korea Institue of Finance and Artificial Intelligence

New activity in openchat/openchat-3.6-8b-20240522 7 months ago

Thanks for sharing new model

#1 opened 7 months ago by

sackoh

liked a model 7 months ago

openchat/openchat-3.6-8b-20240522

Text Generation • Updated May 28 • 16.2k • 151

liked a dataset 7 months ago

EunsuKim/CLIcK

Viewer • Updated Sep 7 • 2k • 500 • 10

updated a dataset 7 months ago

kifai/KoInFoBench

Viewer • Updated May 18 • 60 • 51 • 3

liked 2 models 8 months ago

beomi/Llama-3-Open-Ko-8B

Text Generation • Updated May 20 • 31.8k • 133

beomi/Llama-3-Open-Ko-8B-Instruct-preview

Text Generation • Updated May 2 • 14k • 56

New activity in kifai/KoInFoBench 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

New activity in google-research-datasets/mbpp 8 months ago

This dataset is broken!

#5 opened 10 months ago by

j3m

liked a Space 11 months ago

Running

3.79k

🏆🤖

Chatbot Arena Leaderboard

reacted to akhaliq's post with ❤️ 12 months ago

Post

Here is my selection of papers for today (12 Jan)

https://huggingface.co/papers

PALP: Prompt Aligned Personalization of Text-to-Image Models

Object-Centric Diffusion for Efficient Video Editing

TRIPS: Trilinear Point Splatting for Real-Time Radiance Field Rendering

Diffusion Priors for Dynamic View Synthesis from Monocular Videos

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

TOFU: A Task of Fictitious Unlearning for LLMs

Patchscope: A Unifying Framework for Inspecting Hidden Representations of Language Models

Secrets of RLHF in Large Language Models Part II: Reward Modeling

LEGO:Language Enhanced Multi-modal Grounding Model

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages

A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

Towards Conversational Diagnostic AI

Transformers are Multi-State RNNs

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Distilling Vision-Language Models on Millions of Videos

Efficient LLM inference solution on Intel GPU

TrustLLM: Trustworthiness in Large Language Models

liked a Space about 1 year ago

Running on CPU Upgrade

502

📉

Open Ko-LLM Leaderboard

liked a model over 1 year ago

beomi/llama-2-ko-70b

Text Generation • Updated Nov 3, 2023 • 37