Approximating Two-Layer Feedforward Networks for Efficient Transformers Paper • 2310.10837 • Published Oct 16, 2023 • 10
Skywork: A More Open Bilingual Foundation Model Paper • 2310.19341 • Published Oct 30, 2023 • 5