OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7 • 111
Generating a Low-code Complete Workflow via Task Decomposition and RAG Paper • 2412.00239 • Published 24 days ago • 4
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation Paper • 2312.11532 • Published Dec 15, 2023 • 5
LLM-Assisted Code Cleaning For Training Accurate Code Generators Paper • 2311.14904 • Published Nov 25, 2023 • 4
CodeCoT and Beyond: Learning to Program and Test like a Developer Paper • 2308.08784 • Published Aug 17, 2023 • 5
HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting Paper • 2312.03461 • Published Dec 6, 2023 • 15
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing Paper • 2310.13855 • Published Oct 20, 2023 • 1
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization Paper • 2310.16427 • Published Oct 25, 2023 • 1
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs Paper • 2311.13600 • Published Nov 22, 2023 • 42
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning Paper • 2310.16049 • Published Oct 24, 2023 • 4
CodeFusion: A Pre-trained Diffusion Model for Code Generation Paper • 2310.17680 • Published Oct 26, 2023 • 70
The ART of LLM Refinement: Ask, Refine, and Trust Paper • 2311.07961 • Published Nov 14, 2023 • 10
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster Paper • 2311.08263 • Published Nov 14, 2023 • 15
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure Paper • 2311.07590 • Published Nov 9, 2023 • 16
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 23
Memory Augmented Language Models through Mixture of Word Experts Paper • 2311.10768 • Published Nov 15, 2023 • 16