6 1 109

Alex

AlexPoto

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

unsloth/QwQ-32B-GGUF

new activity 13 days ago

blues-alex/YandexGPT-5-Lite-8B-pretrain-Q4_K_M-GGUF:Q8?

reacted to Kseniase's post with 🚀 28 days ago

View all activity

Organizations

None yet

AlexPoto's activity

liked a model 3 days ago

unsloth/QwQ-32B-GGUF

Text Generation • Updated about 6 hours ago • 49.9k • 48

New activity in blues-alex/YandexGPT-5-Lite-8B-pretrain-Q4_K_M-GGUF 13 days ago

Q8?

#1 opened 13 days ago by

AlexPoto

reacted to Kseniase's post with 🚀 28 days ago

Post

7779

8 New Types of RAG

RAG techniques continuously evolve to enhance LLM response accuracy by retrieving relevant external data during generation. To keep up with current AI trends, new RAG types incorporate deep step-by-step reasoning, tree search, citations, multimodality and other effective techniques.

Here's a list of 8 latest RAG advancements:

1. DeepRAG -> DeepRAG: Thinking to Retrieval Step by Step for Large Language Models (2502.01142)
Models retrieval-augmented reasoning as a Markov Decision Process, enabling strategic retrieval. It dynamically decides when to retrieve external knowledge and when rely on parametric reasoning.

2. RealRAG -> RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning (2502.00848)
Enhances novel object generation by retrieving real-world images and using self-reflective contrastive learning to fill knowledge gap, improve realism and reduce distortions.

3. Chain-of-Retrieval Augmented Generation (CoRAG) -> Chain-of-Retrieval Augmented Generation (2501.14342)
Retrieves information step-by-step and adjusts it, also deciding how much compute power to use at test time. If needed it reformulates queries.

4. VideoRAG -> VideoRAG: Retrieval-Augmented Generation over Video Corpus (2501.05874)
Enables unlimited-length video processing, using dual-channel architecture that integrates graph-based textual grounding and multi-modal context encoding.

5. CFT-RAG -> CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter (2501.15098)
A tree-RAG acceleration method uses an improved Cuckoo Filter to optimize entity localization, enabling faster retrieval.

6. Contextualized Graph RAG (CG-RAG) -> CG-RAG: Research Question Answering by Citation Graph Retrieval-Augmented LLMs (2501.15067)
Uses Lexical-Semantic Graph Retrieval (LeSeGR) to integrate sparse and dense signals within graph structure and capture citation relationships

7. GFM-RAG -> GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation (2502.01113)
A graph foundation model that uses a graph neural network to refine query-knowledge connections

8. URAG -> URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT (2501.16276)
A hybrid system combining rule-based and RAG methods to improve lightweight LLMs for educational chatbots

1 reply

liked a dataset about 1 month ago

kristaller486/Nebo-T1-Russian

Viewer • Updated Feb 2 • 16.4k • 536 • 13

reacted to kristaller486's post with 🚀 about 1 month ago

Post

1393

Nebo-T1-Russian

(Probably) the first "longCoT" dataset for the Russian language created via Deeseek-R1.

- Prompts taken from the Sky-T1 dataset and translated via Llama3.3-70B.
- Answers and reasoning generated by Deepseek-R1 (685B).
- 16.4K samples in total, ≈12.4K Russian-only (in the rest, either the answer or reasoning is in English).
- Languages in the answers and reasoning are labeled using fasttext.

kristaller486/Nebo-T1-Russian

liked a model about 1 month ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated Jan 30 • 26.7k • 226

liked 2 Spaces about 1 month ago

130

Qwen2.5 VL 72B Instruct

💻

Interact with Qwen2.5-VL-Chat model using text and files

1.87k

Chat With Janus-Pro-7B

🌍

A unified multimodal understanding and generation model.

liked 2 models about 1 month ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 301k • 3.2k

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 3 days ago • 268k • 362

New activity in benxh/Qwen2.5-VL-7B-Instruct-GGUF about 1 month ago

Wrong format?

#1 opened about 1 month ago by

AlexPoto

liked a model about 1 month ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 4 days ago • 3.41M • 644

liked 2 models about 2 months ago

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

Updated Jan 25 • 440k • 122

bartowski/Qwen2-VL-72B-Instruct-GGUF

Image-Text-to-Text • Updated Dec 18, 2024 • 3.56k • 11

reacted to nyuuzyou's post with 🤗 about 2 months ago

Post

1506

🗂️ I don't think the collections feature of Hugging Face is widely used, even though it's an excellent way to organize and discover interesting resources. To do my bit to change that, I've created two carefully curated collections that combine both my original work and other valuable datasets:

Educational Datasets
- Mostly English-Russian, but other languages are also included
- Extended by my new Begemot.ai dataset (2.7M+ Russian education records) nyuuzyou/begemot

Link: nyuuzyou/educational-datasets-677c268978ac1cec96cc3605

Anime & Art

- Extensive art-focused collection, including my new datasets:
- Buzzly.art (2K artworks) nyuuzyou/buzzlyart
- Paintberri (60K+ pieces) nyuuzyou/paintberri
- Itaku.ee (924K+ items) nyuuzyou/itaku
- Extended with other amazing datasets from the community

Link: nyuuzyou/anime-and-art-677ae996682a389fccd892c3

Collections should become a more common feature - hopefully this will encourage others to create and share their own curated collections. By organizing related datasets into these themed collections, I hope to make it easier for researchers and developers to discover and use these valuable resources.