Your AI Operating System — Voice, Vision, and Total Control of Your Digital Life
CortexOS is a personal AI operating system that acts as your intelligent assistant across every communication channel. Unlike simple chatbots, CortexOS can see, hear, speak, remember, and take action — managing your email, calendar, files, messages, and more.
"Read my latest emails and reply to the one from Marco"
"Schedule a meeting with the team for tomorrow at 3 PM"
"What do you see in front of me?" — using your live camera
"Research the latest AI news and summarize the top 5 articles"
"Send an invoice for $500 to the client"
"Remember that I prefer meetings in the afternoon"
Talk to CortexOS however you want — by text, voice, video, or even a phone call. Every channel has access to the same brain, the same tools, and the same memory.
Browser-based chat at /chat
Full bot with photo & voice support
Via Evolution API — text and media
Twilio SIP — natural voice conversation
WebRTC real-time voice via Daily
Live camera + voice — CortexOS can see
Over 25 built-in tools, plus the ability to create new ones on the fly. CortexOS doesn't just talk — it takes action.
Read, search, and send emails. Filters by primary inbox by default. Supports full Gmail search queries — from, to, labels, unread, attachments.
list_emails · read_email · send_emailView upcoming events, create meetings with timezone support, delete events. Multi-calendar support with custom date ranges.
list_events · create_event · delete_eventFull Google Tasks CRUD — create, list, complete, and delete tasks. Supports multiple task lists and due dates.
list_tasks · create_task · complete_task · delete_taskCreate, read, update, search, and list files on Google Drive. Save reports, notes, documents directly from conversation.
drive_create · drive_read · drive_update · drive_listSave new contacts with full details (name, email, phone, company, title) and search existing contacts by any field.
save_contact · search_contactsTake a photo of a business card — CortexOS reads it with vision AI and automatically saves the contact to Google Contacts.
scan_business_cardSend WhatsApp messages to any phone number via the Evolution API integration. International format, text messages.
send_whatsappSend professional SMS text messages worldwide. Customizable sender ID, international phone format, 480 character limit.
send_smsReal-time internet search via DuckDuckGo + Tavily. Research topics, check news, find factual data that CortexOS doesn't have locally.
web_search · tavily_search · tavily_researchFull headless browser: navigate, click, type, fill forms, take screenshots, extract text and links. Multi-step web interactions.
browse_web · browser_actionSemantic long-term memory powered by ChromaDB. Stores and recalls preferences, facts, and conversation history across sessions. Per-user isolation.
store_memory · search_archival_memoryCurrent time awareness for scheduling, timezone conversions, and time-sensitive operations.
get_current_timeCreate invoices, process payments, manage customers via Square POS integration. Supports sandbox and production environments.
square_billingCortexOS can modify its own instructions, daily briefing routines, and user profiles at runtime. Self-improving system.
manage_configCortexOS can create new tools on its own. Need a tool that doesn't exist? Just describe it — CortexOS writes the Python code, tests it in a sandbox, registers it, and makes it available instantly. With automatic rollback if anything fails.
create_tool · delete_toolModel Context Protocol (MCP) server support for extending capabilities. Currently connected to Tavily for advanced web research with search, extract, crawl, and map tools.
tavily_search · tavily_extract · tavily_crawl · tavily_mapCortexOS uses a "Think → Act → Observe" loop to solve problems step by step, just like a human assistant would.
list_events for today's
date.search_contacts for
"Marco" to find his email.send_email with a
formatted summary.For developers and technical users — how CortexOS is built under the hood.
py_compileThe most advanced feature — real-time video and voice conversation with AI that can see your camera.
capture_participant_video() starts decoding their camera
framesInputImageRawFrame objects (1 FPS)send_realtime_input(video=Blob)CortexOS/ ├── app/ │ ├── main.py ← FastAPI entry point, routing │ ├── agent_core.py ← ReAct brain (1440 lines) │ ├── llm_router.py ← Multi-model failover │ ├── config_manager.py ← Pydantic-Settings config │ ├── telegram_bot.py ← Telegram webhook handler │ ├── whatsapp_handler.py ← Evolution API handler │ ├── voice_agent.py ← Pipecat voice pipeline │ ├── vision_agent.py ← Pipecat vision+voice pipeline │ ├── mcp_client.py ← MCP server connector │ ├── auth/ │ │ ├── oauth_handler.py ← Google OAuth2 flow │ │ └── token_manager.py ← Token refresh & persistence │ ├── db/ │ │ ├── sqlite_manager.py← SQLite schema & queries │ │ └── chroma_manager.py← ChromaDB vector store │ └── tools/ │ ├── google_gmail.py ← Gmail API integration │ ├── google_calendar.py │ ├── google_tasks.py │ ├── google_drive.py │ ├── web_search.py ← DuckDuckGo search │ ├── browser_tool.py ← Playwright headless browser │ ├── square_billing.py │ ├── clicksend_sms.py │ ├── business_card_scanner.py │ ├── tool_creator.py ← Dynamic tool generation │ └── dynamic_tools/ ← Auto-generated tools ├── config/ │ ├── .env ← API keys (Setup Wizard) │ ├── core_instructions.md ← System prompt │ └── available_tools.json ← Tool registry ├── templates/ ← Jinja2 HTML templates ├── static/ ← CSS, JS, icons ├── scripts/ │ ├── backup.sh ← Daily backup → Google Drive │ └── gdrive_upload.py ← Drive upload utility ├── docker-compose.prod.yml ├── Dockerfile └── requirements.txt