Krum Arnaudov

krumeto
ยท

AI & ML interests

None yet

Recent Activity

reacted to singhsidhukuldeep's post with ๐Ÿ‘ 28 days ago
Exciting breakthrough in AI Recommendation Systems! Just read a fascinating paper from Meta AI and UW-Madison researchers on unifying generative and dense retrieval methods for recommendations. The team introduced LIGER (LeveragIng dense retrieval for GEnerative Retrieval), a novel hybrid approach that combines the best of both worlds: Key Technical Innovations: - Integrates semantic ID-based generative retrieval with dense embedding methods - Uses a T5 encoder-decoder architecture with 6 layers, 6 attention heads, and 128-dim embeddings - Processes item attributes through sentence-T5-XXL for text representations - Employs a dual-objective training approach combining cosine similarity and next-token prediction - Implements beam search with size K for candidate generation - Features an RQ-VAE with 3-layer MLP for semantic ID generation Performance Highlights: - Significantly outperforms traditional methods on cold-start recommendations - Achieves state-of-the-art results on major benchmark datasets (Amazon Beauty, Sports, Toys, Steam) - Reduces computational complexity from O(N) to O(tK) where t is semantic ID count - Maintains minimal storage requirements while improving recommendation quality The most impressive part? LIGER effectively solves the cold-start problem that has long plagued recommendation systems while maintaining computational efficiency. This could be a game-changer for e-commerce platforms and content recommendation systems! What are your thoughts on hybrid recommendation approaches?
liked a model about 2 months ago
INSAIT-Institute/BgGPT-Gemma-2-27B-IT-v1.0-GGUF
View all activity

Organizations

None yet

krumeto's activity

New activity in jinaai/jina-reranker-v1-turbo-en 9 months ago
New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 about 1 year ago
New activity in Intel/neural-chat-7b-v3-1 about 1 year ago

Context Length

2
#7 opened about 1 year ago by
mrfakename
New activity in Intel/neural-chat-7b-v3 about 1 year ago
New activity in TheBloke/Llama-2-70B-Chat-AWQ about 1 year ago

Performance and latency vs. GPTQ

1
#3 opened about 1 year ago by
krumeto