Cost Optimization
How we reduced OpenAI API costs from $1.08/hour to $0.04/hour while maintaining quality.
Overview
By strategically selecting the right AI models for each task, we achieved a 97% cost reduction. This makes Snoopi incredibly affordable for users while maintaining the high-quality AI features you expect.
Cost per hour (before)
Cost per hour (optimized)
Optimization Strategy
The key insight: not all AI tasks require the most expensive model. We analyzed each feature and matched it with the most cost-effective model that maintains quality.
Model Selection
GPT-4o-mini
97% cheaper than GPT-4 • Perfect for real-time operations
Used for:
- Chat assistant responses
- Activity insights generation
- Meeting suggestions
- Email drafting
- Session insights
GPT-4o
67% cheaper than GPT-4 Turbo • High quality for critical tasks
Used for:
- Meeting summaries
- Meeting insights & action items
Whisper-1
Industry-leading audio transcription • Unchanged
Used for:
- Real-time meeting transcription
- Audio to text conversion
Technical Implementation
We updated the AI service layer to route requests to the appropriate model based on the task requirements.
Code Changes
The following functions were optimized in desktop-app/src/services/ai-service.ts:
Real-World Impact
These optimizations translate to significant savings for our users who bring their own OpenAI API keys.
Typical Usage Costs
~20-30 activity checks
With transcript & insights
Activity tracking only
Monthly Projections
Typical user / month
vs. $10-30/month for competitor subscription services
Quality Maintained
Despite the massive cost reduction, we maintained quality by:
- Using GPT-4o for quality-critical tasks - Meeting summaries and insights still use a top-tier model
- Leveraging GPT-4o-mini's strengths - For real-time chat and simple analysis, GPT-4o-mini performs excellently
- Keeping Whisper unchanged - Audio transcription quality remains industry-leading
- Testing thoroughly - All optimized features were tested to ensure they meet quality standards
Key Takeaways
97% cost reduction achieved
From $1.08/hour to $0.04/hour
Quality maintained
Strategic model selection ensures high-quality outputs
User savings
Most users spend under $5/month instead of $10-30
Competitive advantage
Transparent, affordable pricing beats subscription models
Future Optimizations
We're exploring additional ways to reduce costs even further:
- Prompt caching - Reduce costs by 50% on repeated context
- Response caching - Cache common queries and responses
- Batch processing - Group similar requests for efficiency
- Smart rate limiting - Prevent unnecessary API calls