Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28 • 95
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters Paper • 2406.16758 • Published Jun 24 • 19
Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts Paper • 2404.09221 • Published Apr 14 • 1
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters Paper • 2406.16758 • Published Jun 24 • 19
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters Paper • 2406.16758 • Published Jun 24 • 19 • 3
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published Jun 4 • 37
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published Jun 4 • 37
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper • 2405.15574 • Published May 24 • 53
Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts Paper • 2404.09221 • Published Apr 14 • 1
Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions Paper • 2311.00233 • Published Nov 1, 2023 • 4
Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Approach for Object Detection Paper • 2310.17097 • Published Oct 26, 2023 • 3
Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions Paper • 2311.00233 • Published Nov 1, 2023 • 4
Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Approach for Object Detection Paper • 2310.17097 • Published Oct 26, 2023 • 3