Nguyễn Hoàng Long's picture
1

Nguyễn Hoàng Long

oggyfaker

AI & ML interests

Computer Vision

Recent Activity

Organizations

None yet

oggyfaker's activity

reacted to Jaward's post with ❤️ 26 days ago
view post
Post
3001
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted an article 5 months ago