I need a developer to build a browser-based application that enables natural, real-time, two-way voice conversations between a user and an AI model. The application should be lightweight, embeddable into an existing website, and have the following capabilities:
Core Features: 1. Live Two-Way Audio • User clicks a “Start Conversation” button to initiate a voice session. • AI and user can speak back and forth with minimal latency. • AI can respond while the user is speaking (interruptions allowed).
2. Streaming Speech-to-Text & Text-to-Speech • User’s voice is continuously transcribed in real time. • AI responses are converted to natural-sounding speech and played instantly.
3. Custom AI Model & Knowledge Base • The AI will be fine-tuned or configured to follow a specific role/persona and to operate within a defined knowledge base. • This role/persona will be predefined but should be easy to adjust in the code.
4. User-Uploaded Documents for Context • Users can upload documents (PDF, Word, or text) before or during the session. • The AI should be able to reference and incorporate this content dynamically into the conversation. • Uploaded files should be processed so the AI can search and quote relevant sections when replying.
5. Embeddable UI • Simple, single-screen interface with: • “Start Conversation” / “End Conversation” controls • Optional transcript display of the conversation • File upload area for session-specific documents • Minimal styling but easy to theme for my brand.
6. Optional Data Passing • Ability to load additional text/data into the AI’s context before the session starts.
Technical Requirements: • Must use OpenAI’s Realtime API (or similar low-latency LLM service) for voice streaming. • Browser-based microphone access via WebRTC (or equivalent). • Clean, well-commented HTML/JavaScript (or minimal React) that can be embedded into a Wix site. • Clear instructions for changing system prompt, voice, and any AI settings.
Deliverables: • Fully functional prototype that runs in Chrome and Edge without extra installs. • Code files and documentation for setup and configuration.
Nice-to-Have (future scope, not required now): • Session recording and playback • Multiple AI personas selectable at start • Visual indicators when AI is listening or speaking
This is a short project — I’m looking for a working prototype ready for testing within a week.
NeoForge Limestone Block Variants Category: Game Design, Game Development, Game Testing, Java, JSON, Minecraft, Software Development, Software Engineering Budget: $250 - $750 CAD
24-Aug-2025 10:04 GMT
Multilingual Hotel Investment Website Category: Content Management System (CMS), Graphic Design, HTML, SEO, Web Design, Web Development, WordPress Budget: €2 - €6 EUR
Document Proofreading in Spanish Language Category: Content Writing, Copy Editing, Editing, English (US) Translator, Language Tutoring, Linguistics, Proofreading, Spanish Translator, Spanish Tutoring, Translation Budget: ₹600 - ₹1500 INR
24-Aug-2025 10:00 GMT
US Footwear Sales Strategy Development Category: Brand Management, Business Analysis, Business Development, Digital Marketing, Internet Marketing, Market Analysis, Market Research, Marketing Budget: $8 - $15 USD
Application development Category: API Development, Backend Development, Cloud Development, Database Management, Mobile App Development, Python, Software Development, User Interface / IA, Web Application Budget: ₹250000 - ₹500000 INR
24-Aug-2025 09:58 GMT
Zen Cart Payment Module Development Category: API Integration, ECommerce, MySQL, Payment Processing, PHP, Software Architecture, Software Development, Zen Cart Budget: $20 - $100 USD
24-Aug-2025 09:56 GMT
Crypto Grabber Development Category: API Integration, Blockchain, C, Programming, Cryptocurrency, JavaScript, PHP, Software Architecture, Software Development Budget: ₹600 - ₹1500 INR
24-Aug-2025 09:54 GMT
Hindi Love Song Writer Category: Audio Production, Audio Services, Creative Writing, Hindi Translator, Music, Music Production, Poetry, Romance Writing Budget: ₹750 - ₹1250 INR
24-Aug-2025 09:52 GMT
Organic Cooking Channel Subscriber Boost Category: Content Creation, Content Marketing, Facebook Marketing, SEO, Social Media Management, Social Media Marketing, Twitter, Video Editing, YouTube Budget: ₹600 - ₹1500 INR
24-Aug-2025 09:50 GMT
Seeking a talented Native Spanish Subtitle Translator Category: Castilian Spanish Translator, Editing, English (US) Translator, Language Tutoring, Linguistics, Proofreading, Spanish Translator, Subtitles & Captions, Translation, Video Services Budget: $250 - $750 USD
24-Aug-2025 09:50 GMT
Twitter Sentiment Analysis with AI Category: Data Mining, HTML, Java, Machine Learning (ML), Natural Language Processing, NumPy, Pandas, Python, Sentiment Analysis, Streamlit Budget: ₹12500 - ₹37500 INR