L^2M: Mutual Information Scaling Law for Long-Context Language Modeling Paper • 2503.04725 • Published 6 days ago • 19
TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision Paper • 2404.10771 • Published Apr 16, 2024 • 1
ANTN: Bridging Autoregressive Neural Networks and Tensor Networks for Quantum Many-Body Simulation Paper • 2304.01996 • Published Apr 4, 2023 • 1
QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation Paper • 2406.00132 • Published May 31, 2024 • 6