Overview
AudioMemo
AudioMemo is a native Android application that records audio, transcribes it using the OpenAI Whisper API, and provides AI-powered summaries. Built with modern Android development practices, it features a sleek intuitive UI and robust background processing capabilities.
Features
- Audio Recording: High-quality voice recording with a real-time audio wavelength visualization UI.
- Smart Chunking: Automatically splits long audio recordings into manageable 30-second chunks and saves them to local storage.
- AI Transcription: Integrates with OpenAI's
whisper-1model to provide accurate transcriptions of recorded audio. - Intelligent Summarization: Uses OpenAI's
gpt-4o-minimodel to generate concise summaries from the transcriptions. - Local Storage: Securely stores audio chunks and transcriptions locally using Room Database.
- Background Processing: Ensures seamless transcription and summarization tasks run efficiently in the background using WorkManager.
Tech Stack
The app is built using the latest Android development technologies and architecture patterns:
- Language: Kotlin
- UI Framework: Jetpack Compose (Material 3)
- Architecture: MVVM (Model-View-ViewModel)
- Concurrency: Coroutines & Flow
- Dependency Injection: Dagger Hilt
- Networking: Retrofit2 & OkHttp
- JSON Serialization: Kotlinx Serialization
- Local Database: Room
- Background Work: WorkManager
Prerequisites
- Android Studio: Latest version recommended (Giraffe or newer).
- Minimum SDK: API 24 (Android 7.0)
- Target SDK: API 34 (Android 14)
- OpenAI API Key: Required for transcription and summarization features.
Getting Started
-
Clone the repository: