12 242 83

Sergei Averkiev

averoo

https://lingtra.in

averkij

AI & ML interests

None yet

Recent Activity

liked a Space about 3 hours ago

multimodalart/LLaDA

upvoted a paper 7 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

upvoted a paper 7 days ago

SurveyX: Academic Survey Automation via Large Language Models

View all activity

Organizations

averoo's activity

liked a Space about 3 hours ago

LLaDA

🚀

Large Language Diffusion Models

upvoted 2 papers 7 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 11 days ago • 155

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 11 days ago • 90

upvoted 2 papers 10 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 11 days ago • 168

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 11 days ago • 92

upvoted a paper 17 days ago

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Paper • 2502.09082 • Published 18 days ago • 27

liked a dataset 18 days ago

data-for-agents/insta-150k

Viewer • Updated 3 days ago • 147k • 448 • 5

liked a model 18 days ago

nomic-ai/nomic-embed-text-v2-moe

upvoted a paper 27 days ago

Improving Transformer World Models for Data-Efficient RL

Paper • 2502.01591 • Published 28 days ago • 9

upvoted 2 papers about 1 month ago

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published Jan 30 • 22

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Paper • 2501.17433 • Published Jan 29 • 9

liked a model about 1 month ago

lingtrain/labse-udmurt

updated a model about 1 month ago

lingtrain/labse-udmurt

upvoted a paper about 1 month ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 26

liked a model about 1 month ago

baichuan-inc/Baichuan-Omni-1d5

Updated 23 days ago • 381 • 38

upvoted 3 papers about 1 month ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 61

Evolution and The Knightian Blindspot of Machine Learning

Paper • 2501.13075 • Published Jan 22 • 6

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 68

reacted to singhsidhukuldeep's post with 👍 about 1 month ago

Post

2994

Exciting breakthrough in Retrieval-Augmented Generation (RAG): Introducing MiniRAG - a revolutionary approach that makes RAG systems accessible for edge devices and resource-constrained environments.

Key innovations that set MiniRAG apart:

Semantic-aware Heterogeneous Graph Indexing
- Combines text chunks and named entities in a unified structure
- Reduces reliance on complex semantic understanding
- Creates rich semantic networks for precise information retrieval

Lightweight Topology-Enhanced Retrieval
- Leverages graph structures for efficient knowledge discovery
- Uses pattern matching and localized text processing
- Implements query-guided reasoning path discovery

Impressive Performance Metrics
- Achieves comparable results to LLM-based methods while using Small Language Models (SLMs)
- Requires only 25% of storage space compared to existing solutions
- Maintains robust performance with accuracy reduction ranging from just 0.8% to 20%

The researchers from Hong Kong University have also contributed a comprehensive benchmark dataset specifically designed for evaluating lightweight RAG systems under realistic on-device scenarios.

This breakthrough opens new possibilities for:
- Edge device AI applications
- Privacy-sensitive implementations
- Real-time processing systems
- Resource-constrained environments

The full implementation and datasets are available on GitHub: HKUDS/MiniRAG

1 reply

liked a dataset about 1 month ago

alexantonov/chukot_russian_flores_sample

Viewer • Updated about 1 month ago • 100 • 122 • 4