BisonNotes AI
Complete User Guide
v1.8Contents
Getting Started
First Launch Setup
- Install the App — Download BisonNotes AI from the App Store.
- Simple Settings Welcome Screen — You’ll see a streamlined setup with three options:
Initial Setup Options
- OpenAI (Cloud): Cloud-based transcription and AI summaries. Requires OpenAI API key. Most powerful option with pay-per-use pricing.
- On-Device AI: Private, on-device processing. No data leaves your device. Requires download of AI models (2-3GB each) and Parakeet transcription model. Best for recordings under 60 minutes.
- Advanced & Other Options: Skip initial setup and configure later. Access to all engines: Google AI Studio, AWS Bedrock, Mistral AI, OpenAI Compatible, Ollama.
Tip: The simple settings page automatically detects your current configuration. If you’ve configured something in advanced settings that doesn’t match the simple options, it will automatically show “Advanced & Other Options”. - Location Permission — The app will ask for location access if enabled:
- “Allow While Using App”: Recommended — captures location during recording
- “Don’t Allow”: You can still add locations manually later
- Automatic Migration — On first launch, the app scans for existing audio files and migrates them into the database.
Your First Recording
- Start Recording — Tap the large microphone button on the main screen.
- Microphone Permission — On your first recording, iOS will ask for microphone access. Tap “OK”. If denied, re-enable in Settings → Privacy & Security → Microphone.
- Recording Status — You’ll see a red recording indicator, live timer, and location indicator (if enabled).
- Stop Recording — Tap the stop button to end.
- Background Recording — The app continues recording when minimized or the phone is locked.
First Transcript & Summary
Generate Your First Transcript
- Access Recording — After stopping, you’ll see it in the recordings list.
- Start Transcription — Tap the recording, then tap “Generate Transcript”. The app processes your audio using your selected engine.
- Progress — You’ll see processing status and time remaining. You can continue using the app while it processes.
- View Results — Full transcript with editable text, timestamps (if supported), and confidence indicators.
Generate Your First Summary
- Prerequisites — You need a transcript first.
- Start Summary — In the recording detail view, tap “Generate Summary”.
- AI Processing — The app analyzes your transcript and creates:
- Enhanced Summary: Main content overview
- Action Items: Tasks with priority levels
- Reminders: Time-sensitive items with urgency indicators
- Alternative Titles: AI-generated recording names
- Review Results — Expandable sections, priority indicators, interactive maps, and integration options.
iCloud Sync Setup
- “Enable iCloud Sync” — Summaries sync across all your devices. Uses your iCloud storage quota.
- “Keep Local Only” — Summaries stay on this device only. Better for privacy-sensitive content. Can be changed later.
Managing & Deleting Recordings
Long press on any recording, or tap and use the “…” menu. Deletion options:
- Audio File Only — Keeps transcript/summary, removes audio. Good for saving storage.
- Everything — Removes audio, transcript, and summary. Cannot be undone.
- Summary Only — Keeps audio and transcript. Useful for regenerating with a different engine.
Recording Features
iPhone Action Button Integration
Start recording instantly without opening the app first — perfect for capturing thoughts, meetings, or voice notes.
How to Configure the Action Button
- Open the Settings app on your iPhone.
- Navigate to Action Button.
- Select Shortcut as the Action Button function.
- Tap “Choose a Shortcut” → Search for “Start Recording” or “BisonNotes AI” → Select it.
- Press the Action Button to test!
What happens: The app launches, switches to the Recordings tab, and starts recording immediately. Recording continues even if you switch apps or lock your phone.
Location Tracking
- Automatic — GPS location captured with each recording
- Manual — Add or edit location later in the summary view
- Privacy — Location tracking can be disabled in settings
Import Existing Audio
- Tap “Import Audio Files” on the main screen.
- Select audio files from your device.
- Files are automatically added to your recordings library.
Import via Share Extension
- Open the source app — Voice Memos, Files, or any app with the audio file.
- Tap Share — Select “BisonNotes AI” from the share sheet.
- Automatic Import — The file is saved and BisonNotes AI opens to import it.
- Background Import — If BisonNotes AI is already running, it detects the new file immediately.
Supported file types:
- Audio: M4A, MP3, WAV, CAF, AIFF, AIF
- Documents: TXT, MD, PDF, DOC, DOCX
Combining Recordings
How to Combine Recordings
- Select recordings — Go to Recordings tab → tap “Select” → check two recordings.
- Tap “Combine” — A “Combine” button appears when two are selected.
- Choose order — The app recommends chronological order. Tap the “First” card to swap if needed.
- Review & confirm — Check combined duration preview, tap “Combine Recordings”.
- After combining — The new recording appears in your list. Original recordings remain unchanged. Generate new transcripts and summaries for the combined file.
Tips:
- Make sure recordings are in the correct chronological order
- Combined file size = sum of both originals
- You’ll need to generate new transcripts/summaries afterward
AI Engine Configuration
1. On-Device AI — Free & Private
Type: Local processing • Cost: Free • Privacy: 100% local • Internet: Only for initial model download
Requirements
- Transcription: iOS 17.0+, 4GB+ RAM. Uses Parakeet (default) for on-device transcription.
- AI Summary (llama.cpp models): iPhone 15 Pro, iPhone 16+, iOS 18.1+
- Storage: 2-3GB per AI model, plus transcription model download
Recommended Models (by RAM)
- 8GB+: Gemma 3n E4B (4.5 GB) — Best overall quality
- 6GB+: Gemma 3n E2B (3.0 GB) — Good quality, smaller size
- 6GB+: Granite Micro (2.1 GB) — Very fast processing
Experimental Models (enable in settings)
- 8GB+: Granite (4.3 GB) — Reliable and accurate
- 6GB+: Ministral (2.1 GB) — Best for tasks/reminders
- 4GB+: LFM 2.5 Thinking (731 MB) — Fast, minimal summaries
- 6GB+: Qwen3.5 2B (1.3 GB) — Thinking mode, summary only
- 8GB+: Qwen3.5 4B (2.7 GB) — Thinking mode, excellent detail
Best for: Privacy-conscious users, offline use, recordings under 60 minutes
2. OpenAI — Cloud AI
Type: Cloud • Cost: Pay-per-use • Privacy: Data sent to OpenAI • Internet: Required
Setup
- Get API key at platform.openai.com
- Go to Setup → AI Settings → OpenAI
- Enter your API key, select model, test connection.
Summarization Models
| Model | Best For | Tier |
|---|---|---|
| GPT-4.1 Mini | Balanced performance (Default) | Standard |
| GPT-5 Mini | Next-gen reasoning | Premium |
| GPT-5.4 Mini | Latest GPT-5 Mini — improved reasoning | Premium |
Transcription Models
| Model | Notes |
|---|---|
| GPT-4o Transcribe | Most robust. Supports streaming. |
| GPT-4o Mini Transcribe | Cheapest/fastest. Streaming. Recommended. |
| Whisper-1 | Legacy Whisper V2. No streaming. |
Best for: High-quality results, advanced features
3. Google AI Studio — Gemini Models
Type: Cloud • Cost: Free tier, then pay-per-use • Privacy: Data sent to Google • Internet: Required
Setup
- Get API key at aistudio.google.com
- Go to Setup → AI Settings → Google AI Studio
- Enter API key, select model, test connection.
Available Models
- Gemini 3 Flash Preview (Default) — Fast and efficient
- Gemini 3.1 Flash Lite Preview — Lightweight variant for quick processing
Best for: Balanced performance and cost, with free tier for getting started
4. OpenAI API Compatible — LiteLLM, llama.cpp, Groq, etc.
Type: OpenAI-compatible endpoint • Cost: Varies • Privacy: Depends on provider • Internet: Required unless local
Supported Providers
- LiteLLM — Base URL: your server (e.g.,
http://localhost:4000/v1) - llama.cpp — Base URL:
http://localhost:8080. API key: “no-key” or empty. - Groq — Base URL:
https://api.groq.com/v1. API key from console.groq.com - Nebius — Your Nebius endpoint URL
- Custom — Any OpenAI-compatible endpoint
Setup
- Go to Setup → AI Settings → OpenAI API Compatible
- Enter API key and base URL for your provider.
- Select model (use your provider’s exact model name, e.g.,
llama-3.1-70b-versatilefor Groq). - Test the connection.
./server --model <model.gguf>. Or use Python: pip install 'llama-cpp-python[server]'Best for: Using LiteLLM, llama.cpp, Nebius, Groq, or other compatible services
5. Mistral AI — Summarization & Transcription
Type: Cloud • Cost: Pay-per-use (transcription $0.003/min) • Internet: Required
Summarization Models
- Mistral Large (25.12) — Most capable, 128K context (Premium)
- Mistral Medium (25.08) — Balanced, 128K context (Standard)
- Magistral Medium (25.09) — Economy, 40K context
Transcription
- Voxtral Mini Transcribe — $0.003/min with optional speaker diarization
- Supports MP3, MP4, M4A, WAV, FLAC, OGG, WebM
- Automatic language detection or explicit language code
- Files over 24MB automatically chunked
Setup
- Get API key at console.mistral.ai
- Go to Setup → AI Settings → Mistral AI
- Enter API key, select model, test connection.
- For transcription: Setup → Transcription Settings → select “Mistral AI”.
Best for: Fast summaries and affordable transcription with speaker diarization
6. AWS Bedrock — Enterprise Claude AI
Type: Cloud • Cost: Pay-per-use • Internet: Required
Available Models
- Claude 4.5 Haiku (Default) — Fast, efficient (Standard tier)
- Claude Sonnet 4.5 — Advanced reasoning and analysis (Premium tier)
- Llama 4 Maverick 17B — Meta’s latest, enhanced reasoning (Economy tier)
Setup
- Create an AWS account and enable Bedrock service.
- Create IAM user with Bedrock permissions and generate access keys.
- Go to Setup → AI Settings → AWS Bedrock.
- Enter AWS credentials, select region and model, test connection.
Best for: Enterprise features and compliance
7. Apple Native — Apple Intelligence
Type: On-device • Cost: Free • Privacy: 100% local • Internet: Not required
Requirements
- iOS 26.0 or later
- iPhone 15 Pro or newer (Apple Intelligence–capable device)
- Apple Intelligence enabled in Settings → Apple Intelligence & Siri
Setup
- Go to Setup → AI Settings.
- Select Apple Native from the engine library.
- No API key or model download required — uses the built-in Foundation Model.
Limitations
- Context window is 4,096 tokens. Long transcripts are automatically chunked and summarized in segments.
- Only available on devices that support Apple Intelligence.
Best for: Maximum privacy, no setup, and users with Apple Intelligence–capable devices on iOS 26+
8. Ollama — Local LLM Server
Type: Local server • Cost: Free • Privacy: 100% local • Internet: Not required
Setup
- Install Ollama on your machine.
- Download models:
ollama pull qwen3:30b ollama pull gpt-oss:20b ollama pull mistral-small3.2 - Go to Setup → AI Settings → Ollama.
- Set server URL and port (default:
http://localhost:11434). - Tap refresh to scan for models, select one, test connection.
http://192.168.1.100:11434). Larger models = better results but more RAM.Best for: Privacy, customizable models, offline use
Transcription Configuration
Engine Selection
- Go to Setup → Transcription Settings.
- Select your preferred transcription engine.
- Configure and test the connection.
Available Engines
- On Device (Parakeet) — Default. Free, private, works offline. Requires model download.
- OpenAI — GPT-4o Transcribe, GPT-4o Mini Transcribe, or Whisper-1.
- Mistral AI — Voxtral Mini ($0.003/min) with optional speaker diarization.
- Whisper (Local Server) — Your own Whisper server (REST API or Wyoming protocol).
- AWS Transcribe — Cloud service for long audio files.
- OpenAI API Compatible — Any compatible transcription endpoint.
Transcription Engine Comparison
| Engine | Type | Speakers | Cost |
|---|---|---|---|
| On-Device (Parakeet) | Local | No | Free (Default) |
| OpenAI GPT-4o | Cloud | No | Per token |
| OpenAI Whisper | Cloud | No | Per minute |
| Mistral AI | Cloud | Yes | ~$0.003/min |
| AWS Transcribe | Cloud | Yes | Per minute |
| Whisper (local) | Local | No | Free |
On Device Transcription (Parakeet) — Default
Initial Setup
- Enable: Setup → Transcription Settings → select “On Device” → Parakeet is selected by default.
- Download the model: Tap “Download” to get the Parakeet model.
- Requirements: iOS 17.0+, model download required.
Whisper (Local Server) Setup
- Install Whisper Server:
# Using Docker (recommended) docker run -d -p 9000:9000 \ -e ASR_MODEL=base \ -e ASR_ENGINE=openai_whisper \ onerahmet/openai-whisper-asr-webservice:latest - Configure: Setup → Transcription Settings → Whisper (Local Server). Set server URL and port (default: 9000). Choose protocol (REST API or Wyoming) and model size.
AWS Transcribe Setup
- Create an AWS account and enable Transcribe service.
- Create IAM user with
transcribe:StartTranscriptionJobandtranscribe:GetTranscriptionJobpermissions. - Setup → Transcription Settings → AWS Transcribe.
- Enter access key, secret key, region, language. Test connection.
Large File Processing
- Automatic Chunking — Files over 5 minutes are automatically split
- Progress Tracking — Real-time updates
- Background Processing — Continues when app is minimized
- Configurable Timeouts — Adjustable processing time limits
Working with Summaries
Viewing Summaries
- Tap the “Summaries” tab.
- Browse recordings with AI-generated summaries.
- Tap any summary to view details.
Summary Features
- Expandable Sections — Tap to expand/collapse
- Task Extraction — AI-identified actionable items
- Reminder Detection — Time-sensitive reminders
- Priority Indicators — Color-coded priorities
- Location Maps — Interactive maps showing recording location
Search & Filtering
Search is available in three views:
- Summaries View — Search content, tasks, reminders, titles
- Transcripts View — Search transcript text
- Recordings View — Search by recording name
Tap the search bar, type your terms — results filter in real-time (case-insensitive, partial match).
Date Filters
Tap the filter icon → select start/end dates → tap “Apply”. Clear with the X on the filter banner. Filters combine with search and persist until cleared.
Editing Metadata
Change Title, Date, or Location
Recording Title
Open summary → “Titles” section → “Edit” → enter new title or select from AI alternatives → “Use This Title”.
Custom Date & Time
Open summary → “Recording Date & Time” → “Set Custom Date & Time” → use pickers → “Save”.
Location
Open summary → location section → “Add Location” or “Edit Location”. Choose: Current Location (GPS), Map Selection, or Manual Entry → “Save”.
Audio Playback
Basic Playback
- Go to the “Recordings” tab.
- Tap any recording to play.
- Controls: Play/Pause (center), Skip 15s (sides), Scrub (drag progress bar).
Advanced Playback
- Seek Control — Drag scrubber for precise positioning
- Background Playback — Audio continues when app is minimized
- Audio Session Management — Handles interruptions gracefully
Setup & Configuration
Simple Setup vs Additional Settings
Simple Setup
Appears on first launch. Quick setup for common configurations.
- Auto-detects your current config
- Three quick options: OpenAI, On-Device AI, or Advanced
- Preserves settings when switching
Additional Settings
Full control over all options.
- All 8 AI engines
- All transcription engines
- Microphone selection
- Preferences, iCloud, background processing
Audio Settings
- Quality: Whisper Optimized (22 kHz, 64 kbps AAC) — optimized for voice
- Microphone Selection: Choose from available mics (built-in, Bluetooth, USB). Saved across recordings. Auto-fallback if unavailable.
- Mixed Audio: Record without interrupting system audio
- Background Recording: Continue when app is minimized
AI Settings
- Engine Selection — Choose from all 7 AI engines
- Model Configuration — Temperature, max tokens, etc.
- Connection Testing — Verify API connectivity
- Batch Regeneration — Update all summaries with new engine
Troubleshooting
Recording Problems
- No Audio: Check microphone permissions in Settings → Privacy
- Poor Quality: Adjust audio quality settings, ensure proper mic distance
- Background Recording: Enable in app settings
AI Engine Issues
- Connection Failed: Check internet and API keys
- Timeout Errors: Increase timeout settings
- Authentication Errors: Verify API credentials
Transcription Problems
- No Transcription: Check engine configuration
- Poor Quality: Try a different engine or model
- Large File Issues: Enable chunking for files over 5 minutes
Data Issues
- Missing Recordings: Use Data Migration tools
- Corrupted Data: Clear and re-import data
- Sync Problems: Check iCloud settings
Performance Tips
- Battery: Use local engines when battery is low
- Memory: Close other apps during large file processing
- Storage: Regularly clean up old recordings
- Network: Use local engines to reduce data usage
Advanced Features
Background Processing
- Job Queue — Multiple operations queued and processed in order
- Progress Tracking — Real-time updates with accurate status display
- Error Recovery — Automatic retry for failures
- Stale Job Detection (v1.7) — Intelligent detection and recovery of stuck jobs, with timeouts based on actual processing start time
- Job Cancellation (v1.7) — Properly cancel and interrupt running jobs when the app is backgrounded or closed
- On-Device AI Handling — On-device AI jobs are gracefully interrupted when the app is backgrounded, since GPU processing requires the app to stay open
- Live Transcription (v1.8) — On-device speech-to-text streamed in real time during recording via SFSpeechRecognizer; transcript auto-saved when recording stops
- Duplicate Summary Cleanup (v1.8) — Automatically detects and removes duplicate summaries; manual cleanup available in Database Tools
File Management
- Import/Export — M4A, MP3, WAV, CAF, AIFF, AIF
- Audio Export (v1.8) — Share any recording as an audio file via the iOS share sheet
- Video Import (v1.8) — Import video files; audio is automatically extracted to M4A
- Audio Cleanup (v1.8) — Optional pre-transcription DSP: high-pass filter, noise gate, dynamic normalization, peak limiting
- Share Extension — Import from Voice Memos, Files, etc.
- Combining Recordings — Merge two into one
- PDF Export — Professional reports with metadata, maps, page numbers
- File Relationships — Audio/transcript/summary links maintained
- Orphaned File Detection — Find and clean disconnected files
- Selective Deletion — Choose what to keep
Location Intelligence
- GPS Integration — Automatic capture
- Reverse Geocoding — Coordinates to addresses
- Smart Search — 3-tier fallback system
- University Database — Built-in campus mapping
- Interactive Maps — View recording locations
- Manual Location — Add locations after recording
iCloud Backup & Sync
- iCloud Backup — Full backup of all data to iCloud Drive
- Auto-Backup (v1.7) — Automatically backs up when new recordings are created
- CloudKit Sync — Optional summary synchronization across devices with paginated queries and schema-safe fallback for reliability
- Prompted after generating your first summary
Data Migration
- Legacy Import — Migrate from old file-based storage
- Data Integrity — Validate and repair data relationships
- Batch Operations — Process multiple files at once
- Progress Tracking — Monitor migration progress
Best Practices
Recording
- Quiet environment
- Mic 6-12 inches away
- Break long sessions into segments
- Minimize background noise
AI Configuration
- Local engines for sensitive content
- Start free, upgrade as needed
- Experiment with models
- Have a backup engine ready
Data Management
- Export important recordings as PDF
- Use iCloud Backup
- Clean up old recordings
- Use descriptive titles
Performance
- Local engines when battery is low
- Monitor available storage
- Local engines on slow networks
- Close other apps when processing
External Resources
Cloud Services
Local / Self-Hosted
This documentation is regularly updated. For the latest information, check the app’s built-in help.
