97% Cost Reduction

Cost Optimization

How we reduced OpenAI API costs from $1.08/hour to $0.04/hour while maintaining quality.

Overview

By strategically selecting the right AI models for each task, we achieved a 97% cost reduction. This makes Snoopi incredibly affordable for users while maintaining the high-quality AI features you expect.

$1.08

Cost per hour (before)

$0.04

Cost per hour (optimized)

Optimization Strategy

The key insight: not all AI tasks require the most expensive model. We analyzed each feature and matched it with the most cost-effective model that maintains quality.

Model Selection

GPT-4o-mini

97% cheaper than GPT-4 • Perfect for real-time operations

Used for:

  • Chat assistant responses
  • Activity insights generation
  • Meeting suggestions
  • Email drafting
  • Session insights

GPT-4o

67% cheaper than GPT-4 Turbo • High quality for critical tasks

Used for:

  • Meeting summaries
  • Meeting insights & action items

Whisper-1

Industry-leading audio transcription • Unchanged

Used for:

  • Real-time meeting transcription
  • Audio to text conversion

Technical Implementation

We updated the AI service layer to route requests to the appropriate model based on the task requirements.

Code Changes

The following functions were optimized in desktop-app/src/services/ai-service.ts:

chat()GPT-4-turbo → GPT-4o-mini
generateSummary()GPT-4-turbo → GPT-4o
generateActivityInsights()GPT-4-turbo → GPT-4o-mini
generateFollowUpEmail()GPT-4-turbo → GPT-4o-mini
generateMeetingSuggestions()GPT-4-turbo → GPT-4o-mini
generateMeetingInsights()GPT-4-turbo → GPT-4o
generateSessionInsights()GPT-4-turbo → GPT-4o-mini

Real-World Impact

These optimizations translate to significant savings for our users who bring their own OpenAI API keys.

Typical Usage Costs

1-hour work session$0.02-0.08

~20-30 activity checks

1-hour meeting$0.25-0.40

With transcript & insights

Full work day (8hrs)$0.30-0.60

Activity tracking only

Monthly Projections

$3-5

Typical user / month

vs. $10-30/month for competitor subscription services

Quality Maintained

Despite the massive cost reduction, we maintained quality by:

  • Using GPT-4o for quality-critical tasks - Meeting summaries and insights still use a top-tier model
  • Leveraging GPT-4o-mini's strengths - For real-time chat and simple analysis, GPT-4o-mini performs excellently
  • Keeping Whisper unchanged - Audio transcription quality remains industry-leading
  • Testing thoroughly - All optimized features were tested to ensure they meet quality standards

Key Takeaways

97% cost reduction achieved

From $1.08/hour to $0.04/hour

Quality maintained

Strategic model selection ensures high-quality outputs

User savings

Most users spend under $5/month instead of $10-30

Competitive advantage

Transparent, affordable pricing beats subscription models

Future Optimizations

We're exploring additional ways to reduce costs even further:

  • Prompt caching - Reduce costs by 50% on repeated context
  • Response caching - Cache common queries and responses
  • Batch processing - Group similar requests for efficiency
  • Smart rate limiting - Prevent unnecessary API calls