← Back to Projects

What are 50,000 of your viewers actually asking?

Creators publish content, read a handful of comments, and move on. But buried in thousands of comments is some of the most valuable audience research available anywhere — questions people keep asking, frustrations that keep surfacing, topics they're desperate for more of.

What you get

Ingests large volumes of YouTube comments, generates semantic embeddings, clusters them by meaning, and runs an insight agent over the clusters to surface actionable findings.

Demo output coming soon — this project is currently In Progress.

Want something like this for your channel?

Let's talk →

Creator Comment Intelligence Agent

Tier 2In Progress
EmbeddingsClusteringInsight Extraction

Architecture

Data Ingestion
YouTube APIComment IngestionPreprocessing
DigitalOcean
Embedding GenerationClustering (HDBSCAN)Cluster LabelingInsight Extraction
Vercel
Dashboard

Tech Stack

LayerTechnology
Data sourceYouTube Data API v3
EmbeddingsOpenAI text-embedding-3-small
ClusteringHDBSCAN / scikit-learn
OrchestrationPython + asyncio
Insight agentOpenAI GPT-4o + Claude API
OutputNext.js dashboard
InfrastructureDigitalOcean App Platform

Pipeline

1YouTube API
2Comment Ingestion
3Preprocessing & Deduplication
4Embedding Generation
5Clustering (HDBSCAN)
6Cluster Labeling Agent
7Insight Extraction Agent
8Dashboard Output