Voice AI · Built at YC × Moss Conversational AI Hackathon
ProbeIQ
Transforming Educational Videos into Interactive AI Tutors
ProbeIQ is a multimodal AI learning system that converts educational videos into conversational tutors. Instead of passively watching lectures, learners can ask questions naturally and receive grounded answers, explanations, and guidance based on the original source material.

Role
AI Product Design · AI Engineering · Conversation Design
Timeline
2 days
Built At
YC × Moss Conversational AI Hackathon
Focus
Multimodal AI · Conversational UX · Retrieval · Voice Interfaces
Outcome
Working conversational prototype · Video understanding · Grounded responses · Real-time voice interaction
Overview
Educational videos contain enormous amounts of expert knowledge, but interacting with them is difficult.
Learners often rewind videos, search transcripts, or leave the lesson entirely when they become confused.
ProbeIQ explores a different interaction model: transforming passive educational media into an AI tutor that understands the source material and answers questions naturally in real time.
The problem
Students frequently struggle to:
• Find the exact explanation they need • Navigate long instructional videos • Connect concepts across lessons • Receive clarification while learning • Stay engaged without leaving the lesson
The problem isn't a lack of educational content.
The problem is that existing content is difficult to search, navigate, and interact with.
From video to conversation
- Educational Video
- Transcription
- Knowledge Structuring
- Semantic Retrieval
- AI Tutor
- Grounded Answer + Timestamp
Rather than generating generic AI responses, ProbeIQ grounds every answer in the original instructional content. Learners receive contextual explanations tied directly to the lesson being watched.
My contributions
I designed both the product experience and the AI workflow behind ProbeIQ. I also designed how learners transition naturally between watching content, asking questions, retrieving context, and resuming playback.
- Product strategy
- AI interaction design
- Conversation design
- Retrieval workflow design
- Information architecture
- AI UX
- Frontend implementation
- Demo experience
Product decisions
Grounded AI
Instead of hallucinating, every response references the original lesson.
Conversation over Search
Learners ask natural questions rather than searching transcripts manually.
Context Preservation
The tutor maintains awareness of where the learner is within the lesson.
Invisible AI
The technology supports the learning experience without overwhelming the interface.
Demo
The prototype demonstrates an educational video becoming an interactive AI tutor capable of answering questions while preserving context from the lesson.
Prototype demonstrated during the YC × Moss Conversational AI Hackathon.
At the hackathon


How it works
1
Educational video is transcribed into structured learning data.
2
Knowledge is segmented into searchable concepts and timestamps.
3
Semantic retrieval finds relevant instructional context.
4
The conversational AI generates grounded answers while allowing learners to continue watching without interruption.
System architecture
- Video
- Transcription
- Knowledge Structuring
- Retrieval
- AI Tutor
- Grounded Answer + Timestamp
Technology
- Voice AI
- Retrieval-Augmented Generation (RAG)
- Semantic Search
- Multimodal AI
- Real-time Voice Interaction
- Conversation Memory
- Moss
- MiniMax
- Deepgram
- LiveKit
- Qwen
Challenges
- Designing an interface that keeps learners focused on the lesson
- Grounding AI responses in source material instead of generating generic answers
- Creating a conversational workflow without interrupting video playback
- Balancing retrieval accuracy with real-time responsiveness
- Building an end-to-end prototype within a two-day hackathon
Lessons learned
The biggest challenge wasn't building another chatbot. It was designing an AI interaction that felt like learning instead of searching.
Key takeaways
- Retrieval quality determines user trust.
- Conversation must preserve learning context.
- AI works best when grounded in source material.
- The best educational AI feels like an expert sitting beside the learner.
Why this project matters
Educational AI should augment expert knowledge rather than replace it.
ProbeIQ demonstrates how multimodal AI, retrieval, and conversational interfaces can transform static educational content into interactive learning experiences.
Although this prototype focused on educational videos, the same architecture could support enterprise documentation, onboarding, technical training, healthcare education, and professional certification.