97% Cost Reduction

Cost Optimization

How we reduced OpenAI API costs from $1.08/hour to $0.04/hour while maintaining quality.

Overview

By strategically selecting the right AI models for each task, we achieved a 97% cost reduction. This makes Snoopi incredibly affordable for users while maintaining the high-quality AI features you expect.

$1.08

Cost per hour (before)

→

$0.04

Cost per hour (optimized)

Optimization Strategy

The key insight: not all AI tasks require the most expensive model. We analyzed each feature and matched it with the most cost-effective model that maintains quality.

Model Selection

GPT-4o-mini

97% cheaper than GPT-4 • Perfect for real-time operations

Used for:

Chat assistant responses
Activity insights generation
Meeting suggestions
Email drafting
Session insights

GPT-4o

67% cheaper than GPT-4 Turbo • High quality for critical tasks

Used for:

Meeting summaries
Meeting insights & action items

Whisper-1

Industry-leading audio transcription • Unchanged

Used for:

Real-time meeting transcription
Audio to text conversion

Technical Implementation

We updated the AI service layer to route requests to the appropriate model based on the task requirements.

Code Changes

The following functions were optimized in desktop-app/src/services/ai-service.ts:

chat()GPT-4-turbo → GPT-4o-mini

generateSummary()GPT-4-turbo → GPT-4o

generateActivityInsights()GPT-4-turbo → GPT-4o-mini

generateFollowUpEmail()GPT-4-turbo → GPT-4o-mini

generateMeetingSuggestions()GPT-4-turbo → GPT-4o-mini

generateMeetingInsights()GPT-4-turbo → GPT-4o

generateSessionInsights()GPT-4-turbo → GPT-4o-mini

Real-World Impact

These optimizations translate to significant savings for our users who bring their own OpenAI API keys.

Typical Usage Costs

1-hour work session$0.02-0.08

~20-30 activity checks

1-hour meeting$0.25-0.40

With transcript & insights

Full work day (8hrs)$0.30-0.60

Activity tracking only

Monthly Projections

$3-5

Typical user / month

vs. $10-30/month for competitor subscription services

Quality Maintained

Despite the massive cost reduction, we maintained quality by:

Using GPT-4o for quality-critical tasks - Meeting summaries and insights still use a top-tier model
Leveraging GPT-4o-mini's strengths - For real-time chat and simple analysis, GPT-4o-mini performs excellently
Keeping Whisper unchanged - Audio transcription quality remains industry-leading
Testing thoroughly - All optimized features were tested to ensure they meet quality standards

Key Takeaways

97% cost reduction achieved

From $1.08/hour to $0.04/hour

Quality maintained

Strategic model selection ensures high-quality outputs

User savings

Most users spend under $5/month instead of $10-30

Competitive advantage

Transparent, affordable pricing beats subscription models

Future Optimizations

We're exploring additional ways to reduce costs even further:

Prompt caching - Reduce costs by 50% on repeated context
Response caching - Cache common queries and responses
Batch processing - Group similar requests for efficiency
Smart rate limiting - Prevent unnecessary API calls