Post
449
🤖 Controlling Computers with Small Models 🤖
We just released PTA-1, a fine-tuned Florence-2 for localization of GUI text and elements. It runs with ~150ms inference time on a RTX 4080. This means you can now start building fast on-device computer use agents!
Model: AskUI/PTA-1
Demo: AskUI/PTA-1
We just released PTA-1, a fine-tuned Florence-2 for localization of GUI text and elements. It runs with ~150ms inference time on a RTX 4080. This means you can now start building fast on-device computer use agents!
Model: AskUI/PTA-1
Demo: AskUI/PTA-1