01001010
10110101
01101001
11010010
AGENT://
PROTOCOL
ACTIVE
Vision Possible: Agent Protocol

UPCOMING

Vision Possible: Agent Protocol

Your mission, should you choose to accept it: Build multi-modal AI agents that watch, listen, and understand video in real-time.

23 Feb - 1 Mar

Starts in 0 minutes

Mission Rewards

$4,000+

+ exclusive agent swag

+ interview at WeMakeDevs

Vision Possible 2025 Stats

Prizes worth

$

Registrations

Project submissions

Countries

Participants testimonials

Quotation symbol

Recently, I wasn't developing any projects due to personal issues & depression. The #VisionPossible hackathon by @WeMakeDevs reignited my spark! Huge thanks to @kunalstwt for introducing me to @visionagents_ai — it got me building again.

VickyVicky
X
Quotation symbol

Built IntentLens for the Vision Possible Hackathon — a real-time Vision AI agent that tracks people, detects gestures, analyzes movement, and answers questions about what it sees. Learned a lot building this one. Thanks @kunalstwt, @WeMakeDevs & @visionagents_ai!

RishiRishi
X
Quotation symbol

A huge thanks to @WeMakeDevs, @visionagents_ai and @kunalstwt for conducting this amazing hackathon. Got to learn a lot about building vision agents! I even published a detailed blog regarding the project!

SwayamSwayam
X
Quotation symbol

Built for @WeMakeDevs Vision Possible: Agent Protocol Hackathon. First hackathon I've ever submitted. Felt like the right one to finally finish. Full build story + architecture + code.

YaminiYamini
X
Quotation symbol

Built Drishti AI, a real-time eye screening agent that runs on any ASHA worker's Android phone. India has 12 million blind people, but 80% of this blindness is preventable. The Vision Agents SDK made building a real-time vision AI agent feel achievable in 7 days.

MehboobMehboob
X
Quotation symbol

Submitted my FIRST hackathon project yesterday. A week ago, I didn't think I could. Built VisionMate AI at the Vision Possible hackathon by @WeMakeDevs. Biggest lesson? Don't wait to feel ready. Start. 1 week of building > months of tutorials.

SwagatikaSwagatika
X
Quotation symbol

Building my Vision Possible hackathon entry: Live Face Match! Taking my production Pixel Match AI (facial recognition for event photographers) and adding @visionagents_ai realtime video processing. Point camera at attendees → instant live galleries appear.

TosifTosif
X

Mission Briefing

Your mission, should you choose to accept it: Build multi-modal AI agents that watch, listen, and understand video in real-time. Vision Agents gives you the building blocks to create intelligent, low-latency video experiences powered by your models, your infrastructure, and your use cases. Whether you're building security systems, sports coaching AI, interactive gaming, or something we haven't imagined yet - this hackathon is your proving ground. This message will self-destruct... after you build something amazing.

Your Mission Objectives

Build multi-modal AI agents that watch, listen, and understand video in real-time

In the world of AI, video remains the final frontier. Static image analysis is yesterday's mission. Real-time video understanding is the protocol for the future.

Vision Agents gives you the building blocks to create intelligent, low-latency video experiences powered by your models, your infrastructure, and your use cases.

Whether you're building security systems, sports coaching AI, drone detection, or interactive gaming experiences - this hackathon is your chance to push the boundaries of what's possible with real-time Vision AI.

500ms
Join latency
<30ms
Audio/Video latency

Video AI

Real-time video intelligence

Combine YOLO, Roboflow, Moondream, and other vision models with Gemini/OpenAI in real-time. Build agents that truly see and understand.

Ultra-Low Latency

Stream's edge network

Join quickly (500ms) and maintain audio/video latency under 30ms. Your agents respond in real-time, not real-later.

Native LLM APIs

Direct access to the latest models

Native SDK methods from OpenAI, Gemini, and Claude. Always access the latest LLM capabilities without waiting for wrapper updates.

Cross-Platform SDKs

Build anywhere

SDKs for React, Android, iOS, Flutter, React Native, and Unity. Your vision agents can run on any platform.

Mission Rewards

Complete your mission and earn elite status with substantial rewards

Elite Agent Rewards

Alpha Protocol
$2,000

1st Place

Bravo Protocol🎯
$1,500

2nd Place

Best Blog Submission📝
$500

Share your experience using Vision Agents SDK in a blog

+ exclusive agent swag+ interview at WeMakeDevs

Intel Network Rewards

Join the network! Star the Vision Agents repository on GitHub and share your mission progress on social media ( tag @VisionAgents). Top 10 intel reports win swag bundles.

Top 10 Posts Win Swag

Career Opportunities

Outstanding agents may be recruited for positions at WeMakeDevs. Showcase your vision AI skills and join the team building the future of real-time video!

Join the WeMakeDevs Team

Mission Evaluation Criteria

Potential Impact

How effectively does the project address a meaningful problem or unlock a valuable use case in the Vision AI space?

Creativity & Innovation

How unique is the idea? Does it push the boundaries of what's possible with real-time video AI?

Technical Excellence

How well is the project implemented? Does it demonstrate mastery of Vision Agents SDK and related technologies?

Real-Time Performance

Does the agent truly operate in real-time? Is it responsive and low-latency as Vision Agents enables?

User Experience

Is the agent intuitive to interact with? Does it provide a seamless, polished experience?

Best Use of Vision Agents

How effectively does the project leverage Vision Agents' capabilities - video AI, low latency, native APIs, and multi-platform support?

Frequently Asked Questions