alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a Space about 12 hours ago

Qwen/QVQ-72B-preview

updated a Space 1 day ago

AtAndDev/DeepSense.ai

reacted to AkimfromParis's post with 🚀 1 day ago

🇺🇸 🇨🇦 🇬🇧 Nobel Prize winners against USSR & Japanese AI pioneers ☭🇯🇵 🇩🇪 Prof. Jürgen Schmidhuber: “The #NobelPrize in Physics 2024 for Hopfield & Hinton turns out to be a Nobel Prize for plagiarism. They republished methodologies developed in #Ukraine and #Japan by Ivakhnenko and Amari in the 1960s & 1970s, as well as other techniques, without citing the original inventors.” 1965 - First Deep Learning - USSR ☭ (Ukraine 🇺🇦 now) Ivakhnenko and Lapa introduced the first deep learning in deep MLPs that learn internal representations of input data. 1967/68 - Deep Learning by Stochastic Gradient Descent - Japan 🇯🇵 Shun-Ichi Amari trained MLPs with many layers in non-incremental end-to-end fashion from scratch by stochastic gradient descent (SGD). 1969 - Rectified linear unit - Japan 🇯🇵 In 1969, Kunihiko Fukushima introduced ReLU in the context of visual feature extraction in hierarchical neural networks. 1970 - Backpropagation - Finland 🇫🇮 😃 In 1970, Seppo Linnainmaa was the first the reverse mode of automatic differentiation, now known as backpropagation. 1972 - Recurrent Neural Network - Japan 🇯🇵 In 1972, Shun-Ichi Amari published a learning recurrent neural network based on Lenz-Ising model (Amari's net was later called the "Hopfield network". Hopfield republished in 1982, without citing Amari papers.) 1979 - First Convolutional neural network - Japan 🇯🇵 CNN architecture was introduced in 1979 by Kunihiko Fukushima, also known as Neocognitron. https://people.idsia.ch/~juergen/deep-learning-history.html#AMH2

View all activity

Organizations

AtAndDev's activity

upvoted a paper 9 days ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 86

upvoted a paper 14 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 18 days ago • 92

upvoted a paper 15 days ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 87

upvoted a paper 18 days ago

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2 • 16

upvoted a collection 20 days ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 67

upvoted a paper 25 days ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12 • 62

upvoted a paper 29 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25 • 40

upvoted 2 collections about 1 month ago

🧠 Reasoning Models

6 items • Updated 5 days ago • 33

QwQ

Qwen with Questions • 2 items • Updated Nov 28 • 52

upvoted 4 papers about 1 month ago

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 75

V-STaR: Training Verifiers for Self-Taught Reasoners

Paper • 2402.06457 • Published Feb 9 • 9

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 10

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Paper • 2406.12050 • Published Jun 17 • 19

upvoted a collection about 1 month ago

Top LLM

Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 13

upvoted 3 papers about 2 months ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5 • 63

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31 • 59

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3 • 47

upvoted 3 papers 2 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25 • 82

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 52