April 9 - Visual AI Agents Workshop
Join us on April 9 at 9 AM Pacific for the Visual Agents: What it Takes to Build an Agent that can Navigate GUIs like Humans virtual workshop. Register for the Zoom This hands-on workshop provides ...

Source: DEV Community
Join us on April 9 at 9 AM Pacific for the Visual Agents: What it Takes to Build an Agent that can Navigate GUIs like Humans virtual workshop. Register for the Zoom This hands-on workshop provides a comprehensive introduction to building and evaluating visual agents for GUI automation using modern tools and techniques. Participants will learn how to leverage FiftyOne, an open-source toolkit for dataset curation and computer vision workflows, to build production-ready GUI agent systems. What You'll Learn: Dataset Creation & Management: How to structure, annotate, and load GUI interaction datasets using the COCO4GUI standardized format Data Exploration & Analysis: Using FiftyOne's interactive interface to visualize datasets, analyze action distributions, and understand annotation patterns Multimodal Embeddings: Computing embeddings for screenshots and UI element patches to enable similarity search and retrieval Model Inference: Running state-of-the-art models like Microsoft's GUI