Charles Koutcheme's picture

11 79

Charles Koutcheme PRO

koutch

·

https://koutche.me

AI & ML interests

Source code modelling using large language models

Recent Activity

liked a model about 1 month ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

View all activity

Organizations

koutch's activity

upvoted a paper 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 167

upvoted a collection 7 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 25 days ago • 351

upvoted a paper 8 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 68

upvoted a paper 9 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 91

upvoted 2 papers 10 months ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 36

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20 • 47

upvoted a paper 11 months ago

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

Paper • 2401.08406 • Published Jan 16 • 37

upvoted 4 papers about 1 year ago

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 33

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 70

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 53