Blog, Articles, and discussions
Community Articles
view all
Community Articles
view allThoughts on LoRA Training Pt 2: Where to Train
By
•
•
2Thoughts on LoRA Training #1
By
•
•
12MobileNet-V4 (now in timm)
By
•
•
26Against mixing environment setup with code
By
•
SwanLab and Transformers: Power Up Your NLP Experiments
By
•
•
5Fine-tuning Mistral on Your Dataset
By
•
•
4Market Research using AI Evolutionary Algorithms and Multimodal Regression
By
•
•
1CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)
By
•
•
4Train a Shitty Tic-Tac-Toe AI
By
•
The CVPR Survival Guide: Discovering Research That's Interesting to YOU!
By
•
•
8Uncensor any LLM with abliteration
By
•
•
218Low Latency CPU Based Educational Value Classifier With Generic Educational Value
By
•
•
5An Optimal Lossy Variant of Speculative Decoding
By
•
•
1Reports on the Hub: A First Look at Self-governance in Open Source AI Development
By
•
•
5Building a Vision Mixture-of-Expert Model from several fine-tuned Phi-3-Vision Models
By
•
•
2Running Large Multimodal Models on an AI PC's NPU
By
•
•
4Introduction to State Space Models (SSM)
By
•
•
40Saving Memory Using Padding-Free Transformer Layers during Finetuning
By
•
•
6An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct
By
•
•
35Aligning Large Language Models with BRAIn
By
•
•
8