How DailyVox
actually works.
A deep technical breakdown of the on-device AI pipeline behind the Twin. Nine Apple frameworks. Zero third-party SDKs. No cloud calls. Every layer — capture, transcription, NLP, personality modeling, storage — runs on your phone.
Architecture overview.
pipelineEvery piece of data in DailyVox flows through a pipeline that runs entirely on the device. There are zero network calls for AI processing. Here is the full system architecture.
The key constraint: data never leaves the device for processing. Transcription runs on the Neural Engine. NLP runs locally. The Twin is computed and stored in Core Data. The only optional network path is Apple's encrypted iCloud sync — which the user can disable.
The on-device stack.
apple.frameworks[]DailyVox uses nine Apple frameworks to build a full AI pipeline without any third-party dependencies or server-side processing.
SFSpeechRecognizer
The primary transcription engine in v1.0 – 1.x. Converts spoken audio to text entirely on-device.
requiresOnDeviceRecognition = trueensures zero network transmission- Input: AAC audio at 44.1 kHz via AVAudioEngine
- 60+ languages with on-device models
- Real-time partial results for live feedback
- Runs on Apple Neural Engine
SpeechAnalyzer
Apple's next-generation speech recognition framework, replacing SFSpeechRecognizer in v2.0.
- Significantly faster recognition, lower latency
- Native long-form audio without session timeouts
- No user setup required (no permission prompts for on-device)
- Volatile results for instant partial feedback
- Built for sustained recording — ideal for journaling
NLTagger
The core NLP engine that extracts meaning from transcribed text. Runs multiple analysis passes per entry.
- Sentiment scoring — sentence-level valence −1.0 → +1.0
- Named Entity Recognition — people, places, organisations, dates
- Part-of-Speech tagging — verb density, adjective richness
- Language ID — auto-detect entry language
- All tag schemes use on-device CoreML models
NLEmbedding
Generates dense vector representations of journal entries for semantic search and clustering.
- 512-dim sentence embeddings per entry
- Cosine similarity for semantic search
- K-means clustering for hidden thematic groupings
- Foundation for v2.0's RAG retrieval layer
- Vectors persisted in Core Data alongside entry text
Foundation Models
Apple's on-device large language model, enabling conversational Twin interactions in v2.0.
LanguageModelSession— multi-turn conversation with transcript memory- Tool calling — Twin autonomously queries Core Data via custom Tool protocol
@Generable— type-safe structured outputs (mood reports as Swift structs)streamResponse()— real-time streaming chat UI- Dynamic instructions from TwinEngine for tone matching
- Requires iPhone 15 Pro+. Entire pipeline on-device.
Core Data + CloudKit
Local-first persistence with optional encrypted cloud sync across devices.
- SQLite wrapped by NSPersistentCloudKitContainer
- Local-first: app works fully offline
- AIState entity stores the Twin as Codable JSON
- iCloud sync uses Apple's encrypted infrastructure
- Sync is optional — user can disable entirely
CryptoKit
Military-grade encryption for backup exports and sensitive data at rest.
- AES-256-GCM authenticated encryption for backups
- User passphrase for key derivation
- Encrypted JSON export for device migration
LocalAuthentication
Biometric authentication gating access to journal entries.
- Face ID and Touch ID via LAContext
- Biometric keys held in the Secure Enclave
- App lock with configurable auto-lock timeout
- Falls back to device passcode
WidgetKit + AppIntents
Home & Lock-screen widgets. AppIntents-powered Siri Shortcuts for hands-free entries.
- Mood and streak widgets
- Hands-free voice entry via Siri
- AppIntents make Siri aware of DailyVox operations
The Twin Engine.
model cardThe TwinEngine is a custom personality-modeling system that builds a multi-dimensional profile of the user from their voice journal entries. It uses no external models or APIs. The entire model is computed from NLTagger output and stored as serialized JSON in Core Data's AIState entity.
The engine consists of four interconnected sub-models.
CommunicationStyle
STYLEHow the user expresses themselves. Updated with each entry.
- Type-Token Ratio (vocabulary richness)
- Expressiveness score (0 – 1)
- Directness score (0 – 1)
- Formality score (0 – 1)
- Signature words + frequency map
- Average sentence length
- Pronoun patterns (I vs we)
EmotionalSignature
EMOTIONThe user's emotional baseline and patterns over time.
- Valence baseline (positive / negative)
- Arousal baseline (energy level)
- Dominance baseline (control feeling)
- Morning vs evening mood patterns
- Weekday vs weekend patterns
- Trigger topics with correlation scores
- Emotional volatility index
PersonalKnowledgeGraph
GRAPHA network of people, places, and topics with emotional weights.
- NER-extracted entities (person, place, org)
- Emotional weight per entity (−1 → +1)
- Mention frequency over time
- Co-occurrence relationships
- Entity–mood correlation tracking
- Topic clusters from entity groupings
TwinPredictions
PREDICTForecasts based on temporal pattern analysis.
- Day-of-week mood forecasting
- Time-of-day emotional patterns
- Trend direction (improving / declining)
- Seasonal pattern detection
- Trigger anticipation from schedule
- Confidence scores per prediction
Codable structs serialized to JSON and stored in a single Core Data entity (AIState). The entire personality model can be loaded in one fetch, updated incrementally, and synced across devices as a single atomic object. No external database. No vector store until v1.4. Just Core Data.
Privacy architecture.
zero cloudPrivacy is not a feature of DailyVox. It is the architectural constraint every technical decision is built around. The system is designed so that private data physically cannot leave the device for processing.
Zero network processing
Every AI operation runs on the device's Neural Engine. Transcription uses requiresOnDeviceRecognition = true. NLTagger runs locally. The Twin is computed and stored in Core Data. There are no API calls, no cloud functions, no telemetry on journal content.
No third-party SDKs
DailyVox contains zero third-party dependencies for core functionality. No analytics SDKs. No crash reporting that sends journal content. No ad networks. The only external code is Google Analytics on this website (not in the app) and Apple's own frameworks.
Apple's "Data Not Collected"
DailyVox carries Apple's "Data Not Collected" privacy label on the App Store. This is the strictest category — the app collects no data, linked or unlinked to the user's identity.
Cloud AI journal vs DailyVox
| typical cloud AI journal | DailyVox | |
|---|---|---|
| audio processing | sent to cloud servers | on-device Neural Engine |
| AI model location | remote API (OpenAI etc) | Apple on-device models |
| text analysis | cloud NLP service | NLTagger (local) |
| data storage | company servers | Core Data · SQLite on device |
| account required | yes (email, password) | no |
| third-party SDKs | analytics, crash, ads | none |
| privacy label | "Data Linked to You" | "Data Not Collected" |
| works offline | no | yes, fully |
| subscription | $5–15 / month | free |
| who can read your journal | company, employees, sub-processors | only you |
Technical roadmap.
build logWhere DailyVox has been, what's being built now, and where it's going. Each version adds a layer to the on-device AI stack.
Voice journaling + on-device AI
Core voice journaling with fully on-device transcription, NLP, encrypted storage, biometric lock, widgets, Siri Shortcuts.
Twin + personality model
Custom TwinEngine with communication style, emotional signature, knowledge graph, and temporal mood forecasting.
Ask Your Twin + social sharing
TwinChatView with pattern-matched query system. ShareablePersonalityCardView renders at 3× for Instagram Stories and X. Review prompts via SKStoreReviewController at milestone entries.
Semantic search + proactive insights
NLEmbedding for 512-dim sentence embeddings. Cosine similarity vector search. Z-score anomaly detection. K-means clustering. Foundation for v2.0 RAG.
Desktop Twin + wrist capture + localization
Native macOS target — same SwiftUI codebase, sidebar navigation, Twin accessible from the desktop. Apple Watch companion (WatchKit) for voice mood check-ins with Complications. String Catalogs for multi-language UI.
Foundation Models + tool calling + SpeechAnalyzer
On-device 3B Foundation Model. Tool calling lets the Twin autonomously query Core Data. @Generable for structured output. streamResponse() for streaming. SpeechAnalyzer replaces SFSpeechRecognizer. Requires iPhone 15 Pro+. Zero network calls.
LoRA fine-tuning — Twin learns to sound like you
Apple's Foundation Models Adapter Training toolkit for Low-Rank Adaptation. Export 100–1,000 entries as JSONL. Train a personal adapter on Mac. ~160 MB adapter delivered via Background Assets. Loaded via SystemLanguageModel(adapter:). Training data never leaves your Mac.
The most accurate mirror of yourself
After years of daily entries: a Twin that talks like you, sounds like you (Personal Voice), predicts your reactions, explains causality from past entries, and shows personality evolution over time. Full RAG, personal LoRA adapter, autonomous tool calling. Not a clone — it knows your narrated self, not your complete self. Thoughts you don't journal are invisible to it. Entirely on-device, exportable only by you.
Research context.
related workDailyVox exists at the intersection of on-device LLMs, personal AI, and mental-health technology. Several recent research papers explore adjacent ideas.
-
[1]
2026
Memory-Efficient Structured Backpropagation for On-Device LLM Fine-TuningEfficient fine-tuning under mobile memory constraints — directly relevant to v2.5 LoRA adapter training.
-
[2]
2025
MoPHES: On-Device LLMs for Mobile Psychological HealthUsing on-device LLMs for psychological health applications — parallel to the Twin's emotional modelling.
-
[3]
2024
PocketLLM: On-Device Fine-Tuning for Personalized LLMsPersonal model adaptation on mobile hardware — foundational for v2.5's personal adapter.
-
[4]
2023
PLMM: Personal Large Language Models on Mobile DevicesEarly architecture proposal for personal LLMs running on phones — the direction DailyVox pursues.
Open source.
auditableDailyVox is open source. The full codebase — the TwinEngine, all NLP processing, the Core Data stack, the SwiftUI interface — is on GitHub.
Privacy-critical software should be auditable. If you claim data never leaves the device, people should be able to verify that claim by reading the code.
Build with us.
DailyVox is open source and contributions are welcome — improving the Twin engine, adding language support, building the Foundation Models integration.
Try DailyVox.
Free. Private. No account needed. All AI runs on your device.
Download free on the App Store →