Spaces:
Running
title: Meeting Agent Docker
emoji: ๐ณ
colorFrom: yellow
colorTo: indigo
sdk: docker
pinned: false
short_description: Docker build Gradio for the Meeting Intelligence Agent.
๐๏ธ Meeting Intelligence Agent
AI-powered assistant for meeting transcription, analysis, and management.
This project implements an intelligent conversational agent that orchestrates the entire meeting intelligence workflow. It allows users to upload video recordings, automatically transcribe them with speaker identification, edit transcripts, store them in a vector database (Pinecone), and perform advanced RAG (Retrieval-Augmented Generation) queries to extract insights, summaries, and action items.
Complete Project Repo in GitHub: https://github.com/GFiaMon/meeting-intelligence-agent
๐ Table of Contents
- ๐๏ธ Meeting Intelligence Agent
- ๐ Table of Contents
- ๐ Features
- ๐ Documentation
- ๐๏ธ System Architecture
- ๐ Quick Start
- ๐ณ Docker Support
- ๐ Live Demo & Deployment
- ๐ Project Structure
- ๐ Monitoring & Evaluation
- ๐ MCP Integration Details
- ๐ฎ Future Enhancements
- ๐ค Contributing
- ๐ License
- ๐ Acknowledgments
- ๐ง Contact
๐ Features
๐ฃ๏ธ Natural Language Interface: Control everything through a chat-based agent using LangGraph.
๐ Local/Cloud Deployment: Docker + Hugging Face Spaces.
๐น Video Analysis Pipeline:
- Upload MP4/MOV/AVI files directly.
- WhisperX Transcription: High-accuracy speech-to-text.
- Speaker Diarization: Automatically distinguishes between different speakers.
- Smart Speaker Mapping: LLM intelligently assigns real names to speaker labels (e.g., "Speaker_01" โ "Alice") from context.
โ๏ธ Interactive Editor: Review and correct transcripts before commiting them to the database.
๐ง Semantic Search (RAG):
- Stores meetings in Pinecone vector database.
- Intelligent metadata extraction (Titles, Dates, Summaries) using GPT-4o-mini.
- Time-Aware Queries: Understands relative time (e.g., "What did we discuss 2 weeks ago?") using a dedicated Time MCP server.
- Ask questions like "What did we decide about the budget?" or "List all action items for John".
๐ MCP Integration (Model Context Protocol):
- Connects to external tools like Notion to export meeting minutes directly.
- Custom Time Server: World time queries for relative date calculations.
- Zoom Integration: (Future) Real-time meeting capture via RTMS API.
๐ LangSmith Integration: Full tracing and monitoring of agent workflows.
๐ Documentation
Additional Documentation Files
For detailed technical documentation, see: - ARCHITECTURE.md - Full system design - TECHNICAL_IMPLEMENTATION.md - Complete tool reference and Mermaid diagrams - DEPLOYMENT_GUIDE.md - Step-by-step deployment guide - Pinecone Management Script - Utility for database management
๐๏ธ System Architecture
+--------+ +------------------+
| User | <---> | Gradio Interface |
+--------+ +------------------+
^
|
v
+-------------------------------------------------------+
| Conversational Agent (LangGraph) |
| +----------------+ |
| | OpenAI GPT-4 | |
| +----------------+ |
+---------------------------+---------------------------+
|
+-----------------+-----------------+
| | |
v v v
+------------------+ +-------------+ +------------------+
| Tools & Caps | | Video Pipe | | Ext. Integrations|
| - Video Process | | - Upload | | - Notion API |
| - Queries | | - WhisperX | | - Zoom API |
| - External MCP | | - Diarize | | - Time API |
| | | - Editor | | |
+------------------+ +-------------+ +------------------+
| |
+--------+--------+
|
v
+------------------+
| Data Storage |
| (Pinecone DB) |
+------------------+
๐ Quick Start
Prerequisites
- Python 3.11
- FFmpeg (required for audio processing)
- Node.js & npm (optional, required if using Notion MCP integration)
- Pinecone Account
- OpenAI API Key
Installation
Clone the repository:
git clone https://github.com/yourusername/meeting-agent.git cd meeting-agentInstall dependencies:
pip install -r requirements.txtConfigure Environment: Create a
.envfile in the root directory:OPENAI_API_KEY=your_openai_key PINECONE_API_KEY=your_pinecone_key PINECONE_INDEX=your_index_name PINECONE_ENVIRONMENT=us-east-1 # Optional: For Notion MCP ENABLE_MCP=true NOTION_TOKEN=your_notion_key # LangSmith (optional) LANGSMITH_API_KEY=your_langsmith_key LANGSMITH_PROJECT=meeting-agentRun the Application:
python app.pyAccess the UI at
http://localhost:7860.
Pinecone Management
Manage your vector database with the included utility:
# List all meetings
python scripts/manage_pinecone.py list
# View statistics
python scripts/manage_pinecone.py stats
# Delete specific meeting
python scripts/manage_pinecone.py delete meeting_abc12345
๐ณ Docker Support
Build and run the application in a container.
Build the image:
docker build -t meeting-agent .โ ๏ธ IMPORTANT FOR HUGGING FACE SPACES:
Standard Gradio deployment may fail due to specific dependency conflicts (WhisperX/Pyannote).
You must use Docker for deployment.
Userequirements_hf.txt(rename it torequirements.txtinside your deployment repo) which contains safe, Linux-compatible version ranges. The standardrequirements.txtis optimized for local Mac/Dev environments.Run the container:
docker run -p 7860:7860 --env-file .env meeting-agent
๐ Live Demo & Deployment
Hugging Face Spaces
- Live Demo: https://huggingface.co/spaces/GFiaMon/meeting-agent-docker
- One-Click Clone: Click "Duplicate Space" on the Hugging Face page to deploy your own instance
- Auto-Deploy: Cloned spaces automatically build as Docker containers with all dependencies
Custom MCP Servers
โฐ World Time Server: A custom MCP server for timezone-aware queries deployed at https://huggingface.co/spaces/GFiaMon/date_time_mpc_server_tool (Can be cloned or connected as an external MCP server to an AI agent)
๐ฅ Zoom RTMS Integration: In development (
external_mcp_servers/zoom_mcp/), working with Zoom's API team
๐ Project Structure
meeting-agent/
โโโ app.py # ๐ Entry point (Gradio App)
โโโ src/
โ โโโ agents/ # LangGraph Agent definition
โ โโโ config/ # Configuration & Settings
โ โโโ processing/ # Audio/Video processing pipelines
โ โโโ retrievers/ # Pinecone & RAG logic
โ โโโ tools/ # Tool definitions (Video, General, MCP)
โ โโโ ui/ # Gradio UI components
โโโ documentation/ # Technical Documentation
โโโ scripts/ # Helper scripts
โโโ requirements.txt # Dependencies
๐ Monitoring & Evaluation
LangSmith Integration
The agent integrates with LangSmith for comprehensive tracing and monitoring:
- Prompt/Response Tracking: All agent interactions are logged
- Tool Usage: Complete tool execution history
- Performance Metrics: Latency and token usage tracking
- Debugging: Easy identification of issues in complex workflows
Basic Evaluation
While full quantitative metrics are a future enhancement, the system includes:
- Functional Testing: All tools tested end-to-end
- Integration Testing: Pinecone, Notion, and MCP connections verified
๐ MCP Integration Details
Current MCP Servers
- Notion MCP: Official
@notionhq/notion-mcp-serverfor Notion API access - World Time Server: Custom Gradio-based MCP server for timezone-aware queries
- Zoom RTMS: Prototype for future Zoom integration (in development)
Time-Aware Queries Example
The custom Time MCP server enables queries like:
- "What did we discuss last Tuesday?"
- "Show me meetings from 2 weeks ago"
- "What action items were assigned in December?"
The server calculates relative dates and provides timezone-aware timestamps for accurate meeting retrieval.
๐ฎ Future Enhancements
In Development
- Zoom RTMS API Integration: Real-time meeting transcription and capture
- Enhanced Metrics: Quantitative evaluation with LangSmith
- Batch Processing: Handle multiple meetings simultaneously
Planned Features
- Multi-language Support: Transcription in 10+ languages
- Advanced Analytics: Sentiment analysis, speaker analytics
- Export Formats: PDF, Google Docs, etc
๐ค Contributing
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature) - Commit your Changes (
git commit -m 'Add some AmazingFeature') - Push to the Branch (
git push origin feature/AmazingFeature) - Open a Pull Request
๐ License
Distributed under the MIT License. See LICENSE for more information.
๐ Acknowledgments
- Ironhack Data Science & AI Program - Course framework and guidance
- OpenAI - Whisper and GPT models
- WhisperX - Audio/Video processing
- Pinecone - Vector database
- Notion - Notion API access
- LangChain - Agent framework and tools
- Hugging Face - Deployment platform and community
๐ง Contact
Author: Guillermo Fiallo Montero - Data Science & AI Engineer
Project Link: https://github.com/GFiaMon/meeting-intelligence-agent
Capstone Project - Ironhack Data Science & AI Program - December 2025