bytedance/UI-TARS-desktop
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
GitHub repository with 35,902 stars and 3,611 forks.
Language: TypeScript
Topics: agent, vlm, vision, computer-use, mcp, mcp-server, gui-operator, browser-use, gui-agent, multimodal