We analyzed 40 journal apps and classified them into four tiers based on where their AI actually processes your data. The results are sobering: 48% send plaintext journal entries to cloud APIs, and only 5% run all AI entirely on-device. DailyVox is classified as Tier A — zero cloud egress — making it one of the most private AI-powered apps in any category, not just journaling.
The phrase "AI-powered" has become ubiquitous in the journaling space. Nearly every modern journal app now advertises some form of artificial intelligence — smart prompts, mood analysis, sentiment tracking, personalized insights, or conversational reflection. But "AI-powered" tells you nothing about where that AI actually runs. And that distinction — on your device versus on someone else's server — is the single most important privacy question you can ask about any app that touches your personal thoughts.
Existing privacy frameworks fall short here. Apple's App Store privacy labels tell you what data is collected, but not how AI processes it. GDPR and privacy policies describe legal obligations, but not technical architecture. A journal app can truthfully claim "we don't sell your data" while simultaneously sending every entry to OpenAI's API for processing. The privacy label might even say "Data Not Linked to You" while your plaintext thoughts transit through three different cloud services.
We needed a framework that captures where AI actually runs. So we built one.
Why We Built This Framework
Privacy labels were a major step forward when Apple introduced them in 2020. For the first time, users could see at a glance whether an app collected their data, what categories of data were involved, and whether that data was linked to their identity. But six years later, the labels have a blind spot that matters enormously for journal apps: they do not distinguish between AI that runs on your phone and AI that runs on a server farm in Virginia.
Consider two hypothetical journal apps. App A uses Apple's on-device NaturalLanguage framework to analyze sentiment in your entries. The text never leaves your phone. App B sends your entries to a cloud API, receives a sentiment score back, and discards the server-side copy within 30 days. Both apps could legitimately display "Data Not Linked to You" in the App Store — App B because the server-side data is not permanently linked to an identity. But the privacy implications are vastly different. App A's architecture makes a breach impossible. App B's architecture means your journal entries existed, even temporarily, on a server you do not control.
This gap became impossible to ignore as the AI journaling market exploded in 2025. Dozens of apps launched with GPT-powered features — smart prompts, therapeutic conversations, mood prediction — all requiring journal entries to be sent to cloud APIs. Users had no way to distinguish these apps from the small number of apps running AI entirely on-device. The marketing language was identical: "AI-powered insights," "smart journaling," "personalized reflections." The architecture behind those features could not be more different.
We built the On-Device AI Maturity Model to create that distinction. It classifies apps not by what they promise in a privacy policy, but by what they physically do with your data when AI features are invoked. The framework has four tiers, from A (zero cloud egress) to D (third-party AI processing), and each tier is defined by observable, testable architectural properties.
The model is not just theoretical. We applied it to 40 journal apps using network proxy testing, SDK decompilation, privacy label analysis, and architecture audits. The results were published in our Journal App Privacy Audit 2026, and the findings are stark enough to change how you think about which journal app you trust with your innermost thoughts.
The 4 Tiers Explained
Each tier represents a fundamentally different architecture for how AI interacts with your journal data. The tiers are not a spectrum of "better or worse privacy settings" — they represent structural differences in where data physically exists and who can access it. You cannot move from Tier D to Tier A by toggling a setting. It requires rebuilding the app from the ground up.
Tier A: Zero Cloud Egress
Tier A is the gold standard. All AI processing — transcription, sentiment analysis, mood tracking, insights generation, and any conversational features — runs entirely on the device. There is no server infrastructure. There is no account creation. There are no third-party SDKs. The app makes zero network calls during any AI operation.
What this means in practice: you can put your phone in airplane mode, use every AI feature the app offers, and nothing breaks. No loading spinners, no "requires internet" messages, no degraded functionality. The AI models live on your device, the processing happens on your device, and the results stay on your device. A data breach is not just unlikely — it is architecturally impossible, because there is no centralized data store to breach.
Tier A apps carry Apple's strictest privacy label: "Data Not Collected." Their privacy policies are short — often just a few sentences — because there is nothing to disclose. The trade-off historically was that on-device AI was less capable than cloud AI. But with Apple's CoreML, NaturalLanguage framework, and on-device Speech recognition reaching production quality, that gap has narrowed dramatically. Tier A no longer means "limited AI." It means "AI that respects your privacy by design."
Example: DailyVox. Voice transcription uses Apple's on-device Speech framework. Mood analysis runs on CoreML. Sentiment tracking uses the NaturalLanguage framework. The Digital Twin feature processes locally. Zero network calls. Zero servers. Zero accounts. App Store label: "Data Not Collected."
Tier B: Encrypted Cloud with On-Device AI
Tier B apps run all AI processing on-device, but sync data through an encrypted cloud service. The critical distinction from Tier A is that journal data exists, in encrypted form, on infrastructure the user does not control. The AI itself never touches the cloud — transcription, analysis, and insights all happen locally — but the storage and sync layer introduces a server-side component.
In the best implementations, the cloud sync is controlled by the operating system (like iCloud), not the app developer. The developer cannot access journal content even if they wanted to. End-to-end encryption, when properly implemented through the OS layer, means the data on the server is opaque to everyone except the device that created it.
The privacy posture of Tier B is strong but not absolute. Metadata — sync timestamps, entry counts, device information — may not be encrypted. The encryption implementation is controlled by the platform (Apple, in this case), and changes to that implementation are outside the app developer's control. Legal compulsion directed at the platform provider could theoretically compromise data, though Advanced Data Protection significantly mitigates this risk.
Example: Apple Journal. Journal suggestions and AI features run on-device using Apple's frameworks. Data syncs through iCloud with end-to-end encryption available via Advanced Data Protection. No third-party servers are involved. However, an Apple ID is required, and data transits through Apple's infrastructure. The developer (Apple) controls both the device and the sync layer.
Tier C: Cloud Storage with Policy Protections
Tier C apps store journal data on the developer's servers. Encryption may be offered — sometimes end-to-end, sometimes only in-transit and at-rest — but the fundamental architecture is cloud-first. Your entries live on someone else's computer. AI features may run on-device, in the cloud, or a hybrid of both, but the data itself has already left your device by virtue of the storage architecture.
Privacy in Tier C is contractual, not architectural. The company promises, through privacy policies and terms of service, not to access, sell, or misuse your data. These promises are legally binding but practically fragile. Companies get acquired (Day One was acquired by Automattic in 2021). Privacy policies get updated. Legal orders compel disclosure. Encryption implementations change. A breach exposes the centralized data store that Tier A and B apps do not have.
Tier C apps typically require account creation, which links your identity to your journal. They often embed third-party SDKs for analytics, crash reporting, or attribution. The App Store label usually reads "Data Linked to You." End-to-end encryption, where offered, is frequently optional and may not cover all data categories or metadata.
Examples: Day One, Penzu. Day One stores entries on Automattic's servers with optional end-to-end encryption. Penzu stores entries on their servers with AES-256 encryption. Both require accounts. Both have "Data Linked to You" labels. The encryption is real, but the data lives on infrastructure the user does not control, and metadata is typically not covered by E2E encryption.
Tier D: Third-Party AI Processing
Tier D is the most privacy-invasive tier. Journal entries are sent to third-party AI APIs — typically OpenAI, Google, or Anthropic — for processing. Your most personal thoughts travel from your device, through the app developer's servers, to a third-party AI provider's infrastructure. The plaintext of your journal entry exists, at least temporarily, on servers operated by a company you have no direct relationship with.
The privacy implications are compounded by the nature of the data. Journal entries are not search queries or product reviews — they are unfiltered reflections on mental health, relationships, trauma, career anxiety, and personal struggles. When a Tier D app sends this text to a cloud API, it passes through multiple infrastructure layers, each with its own retention policies, employee access controls, and vulnerability surface.
Many Tier D apps market themselves heavily as "AI-powered" precisely because cloud AI models (like GPT-4) are more capable than most on-device alternatives. The prompts are more sophisticated, the conversations feel more natural, and the insights are more detailed. But that capability comes at the cost of transmitting your most private data to the least private infrastructure. Some Tier D apps do not even disclose which AI provider they use, making it impossible to know whose servers are processing your thoughts.
Tier D apps universally require accounts, embed multiple third-party SDKs, and carry "Data Linked to You" labels with extensive data categories. Many do not function without an internet connection, which is the simplest diagnostic test: if the AI features break in airplane mode, the AI runs in the cloud.
Examples: Rosebud, Reflectly. Rosebud sends entries to GPT-based APIs for conversational reflection and therapeutic-style responses. Reflectly sends entries to cloud AI for mood analysis and personalized prompts. Both require accounts, embed analytics SDKs, and do not function meaningfully offline.
How We Evaluated: Methodology
Classifying 40 apps into four tiers required more than reading privacy policies. Marketing language is unreliable — apps that claim "your data stays private" frequently transmit entries to cloud servers. We used four complementary evaluation methods, each designed to verify architectural claims against observable behavior.
1. Network Proxy Testing
We intercepted all network traffic from each app using Charles Proxy and mitmproxy over a 7-day testing period. For each app, we created multiple journal entries, used every AI feature (transcription, insights, prompts, mood analysis), and logged every network request. We examined request payloads for plaintext journal content, encrypted content, metadata, and third-party API calls. This is the most definitive test: if your journal text appears in a network request, the app is not on-device.
We tested in three conditions: normal Wi-Fi usage, cellular-only, and airplane mode. Airplane mode testing was particularly revealing — Tier A apps work identically, Tier B apps work except for sync, Tier C apps show degraded functionality, and Tier D apps fail to provide AI features entirely.
2. SDK Analysis
We decompiled app binaries to identify every embedded third-party SDK. This reveals which analytics providers, crash reporting tools, attribution trackers, and AI service SDKs are bundled into the app. An app that embeds the OpenAI SDK is almost certainly sending data to OpenAI's servers, regardless of what the privacy policy says. Similarly, analytics SDKs (Firebase, Mixpanel, Amplitude) transmit usage data on every app launch.
SDK analysis also reveals dependencies that are not disclosed in privacy policies. Some apps embed advertising attribution SDKs (AppsFlyer, Adjust) that track install sources and user behavior across apps. Others include A/B testing frameworks that transmit user data for experiment evaluation. These SDKs operate independently of the journal features and represent additional data egress that users are unaware of.
3. Privacy Label Review
We compared each app's self-reported App Store privacy label against our observed network behavior and SDK composition. Apple requires apps to accurately report their data practices, but enforcement is complaint-driven. We found discrepancies in several apps — cases where the privacy label understated the actual data collection we observed through proxy testing and SDK analysis.
We categorized labels on a 4-point scale: "Data Not Collected" (strongest), "Data Not Linked to You," "Data Linked to You," and "Data Used to Track You" (weakest). We then cross-referenced label claims against observed behavior for each data category.
4. Architecture Audit
For each app, we evaluated the fundamental architectural decisions: Does the app require an account? Does it function offline? Where do AI models run? Is there a server component? What encryption is used, and who controls the keys? These structural properties determine the privacy ceiling — no amount of policy language can overcome an architecture that requires cloud processing.
We also examined data export capabilities, deletion mechanisms, and what happens to user data if the app or company shuts down. Tier A apps have inherent data portability because data exists only on the device. Tier D apps often have the weakest export options despite storing the most data.
The Results: 40 Apps Classified
After evaluating 40 journal apps across all four methods, the tier distribution reveals a market that overwhelmingly favors cloud-based architectures despite growing user demand for privacy.
| Tier | Classification | Number of Apps | Percentage | Notable Examples |
|---|---|---|---|---|
| Tier A | Zero Cloud Egress | 2 | 5% | DailyVox |
| Tier B | Encrypted Cloud + On-Device AI | 5 | 12.5% | Apple Journal, Calmplot |
| Tier C | Cloud Storage + Policy Protections | 14 | 35% | Day One, Penzu, Journey, Diarium |
| Tier D | Third-Party AI Processing | 19 | 47.5% | Rosebud, Reflectly, Notion |
The distribution is striking. Nearly half of all journal apps we tested — 47.5% — fall into Tier D, meaning they send journal entries to third-party AI providers for processing. Combined with Tier C, that means 82.5% of journal apps store your data on servers you do not control. Only 17.5% keep all AI processing on-device, and only 5% achieve the zero-cloud-egress standard of Tier A.
The concentration at the bottom of the model reflects market incentives. Cloud AI is easier to build: developers call an API, receive a response, and display it. On-device AI requires deep integration with platform frameworks, model optimization for mobile hardware, and significant engineering investment to match the quality of cloud models. Most startups choose the path of least resistance, which is also the path of least privacy.
Key Findings
Beyond the tier distribution, our analysis revealed several patterns that should concern anyone who journals digitally.
48% Send Plaintext Entries to Cloud APIs
Nearly half of the apps we tested transmit journal entries in plaintext — or functionally plaintext (base64-encoded, JSON-wrapped, but unencrypted at the application layer) — to cloud AI APIs. We confirmed this through network proxy testing: the full text of journal entries appeared in HTTP request bodies destined for OpenAI, Google Cloud, or custom AI endpoints. In several cases, the API requests included not just the current entry but previous entries for "context," meaning a single AI call could transmit weeks of journal history.
Only 5% Qualify as Tier A
Out of 40 apps, only 2 achieved Tier A classification. The barrier is high: zero cloud egress means no server, no account, no third-party SDK, and no network call during any AI feature. Most apps fail on at least one criterion. Some have on-device AI but embed analytics SDKs that transmit usage data. Others run AI locally but require an account for the journal itself. Tier A requires every component — storage, identity, AI, and analytics — to be on-device. DailyVox is one of only two apps that met this standard.
73% of "Private" Apps Still Collect Data
We examined the 15 apps in our sample that explicitly market themselves as "private," "secure," or "encrypted" in their App Store descriptions or marketing pages. Of these 15, 11 (73%) still collect data linked to user identity according to their own App Store privacy labels. The word "private" in app marketing has become effectively meaningless. An app can call itself private while requiring an email account, embedding analytics SDKs, syncing to company servers, and processing entries with cloud AI.
The most common pattern: apps that offer encryption for data at rest but still transmit unencrypted metadata, embed tracking SDKs, and send entries to cloud AI for "smart" features. The encryption protects one layer while other layers leak data freely.
SDK Proliferation Correlates with Privacy Tier
We found a strong correlation between the number of embedded third-party SDKs and an app's privacy tier. Tier A apps averaged 0 third-party SDKs. Tier B apps averaged 1.2. Tier C apps averaged 4.7. Tier D apps averaged 8.3. The relationship is not coincidental — apps that are architecturally willing to send data to their own servers are also architecturally willing to embed SDKs that send data to third parties. The decision to build cloud-first cascades into every subsequent integration choice.
Offline Functionality is the Simplest Diagnostic
The easiest way to estimate an app's tier is to use it in airplane mode. In our testing, every Tier A app functioned identically offline. Every Tier D app lost AI functionality offline. Tier B and C apps fell in between, with Tier B apps losing only sync capabilities and Tier C apps showing various degradations. If you enable airplane mode and your journal app's AI features stop working, your entries are being processed in the cloud. It takes 30 seconds and requires no technical knowledge.
What This Means for Users
The On-Device AI Maturity Model reveals a market where the vast majority of journal apps treat your most personal data as raw material for cloud processing. But the model also provides a clear framework for making informed choices.
If privacy is non-negotiable, use a Tier A app. There are only two, and DailyVox is the most feature-complete. Zero cloud egress means the privacy guarantee is architectural — it does not depend on trusting a company, reading a privacy policy, or hoping encryption is implemented correctly. The app literally cannot leak your data because it has no mechanism to transmit it.
If you need cross-device sync, Tier B is the best option that still keeps AI on-device. Apple Journal is the most prominent example. Your data transits through iCloud (encrypted), but AI features run locally. This is a reasonable trade-off for users who need their journal on multiple devices and trust Apple's encryption infrastructure.
If you are currently using a Tier C or D app, understand what that means. Your journal entries — potentially years of your most personal thoughts — exist on servers you do not control. If the app uses cloud AI, those entries have been processed by third-party infrastructure. This does not mean you should panic, but it does mean the privacy label "private" in the app's marketing is aspirational at best and misleading at worst.
The airplane mode test should become a habit. Before trusting any app with sensitive data, enable airplane mode and test every feature. If something breaks, your data is leaving your device. This is the simplest, most reliable privacy test available to non-technical users, and it takes less than a minute.
The market is slowly moving toward on-device AI as Apple's frameworks mature and device hardware gets more powerful. But today, the default is still cloud processing. Knowing your app's tier — and choosing accordingly — is the most impactful privacy decision you can make for your digital journal.
Frequently Asked Questions
What is the On-Device AI Maturity Model?
The On-Device AI Maturity Model is a 4-tier framework for classifying journal apps based on where their AI actually processes your data. Tier A means zero cloud egress — all AI runs on-device with no server, no account, and no network calls. Tier D means your journal entries are sent in plaintext to third-party AI APIs like OpenAI or Google. The model was developed after analyzing 40 journal apps using network proxy testing, SDK analysis, and architecture audits.
What percentage of journal apps send entries to cloud AI?
In our analysis of 40 journal apps, 48% send plaintext journal entries to cloud AI APIs for processing. This includes apps that market themselves as "AI-powered" or "smart" journals. The AI features require your entries to leave your device and be processed on external servers, often operated by third parties like OpenAI, Google, or Anthropic. Users typically have no indication this is happening unless they monitor network traffic.
How many journal apps qualify as Tier A?
Only 5% of the 40 apps we analyzed — just 2 apps — qualify as Tier A. Achieving Tier A requires all AI processing to happen on-device with zero data leaving the phone. DailyVox is one of only two apps that met this classification. Most apps fail because they embed analytics SDKs, require accounts, or use cloud APIs for at least some AI features.
Is DailyVox a Tier A app?
Yes. DailyVox is classified as Tier A — Zero Cloud Egress. All AI features including voice transcription (Apple Speech), mood analysis (CoreML), sentiment tracking (NaturalLanguage), and Digital Twin run entirely on-device using Apple's native frameworks. There is no server, no account, no third-party SDK, and no network call made during any journaling or AI operation. It carries Apple's "Data Not Collected" privacy label.
What tier is Apple Journal?
Apple Journal is classified as Tier B — Encrypted Cloud with On-Device AI. Its AI features run on-device, but data syncs through iCloud. Apple controls both the device and the sync layer, and Advanced Data Protection offers end-to-end encryption. However, it requires an Apple ID and data transits through Apple's infrastructure, which prevents it from achieving Tier A.
How can I test which tier my journal app belongs to?
The simplest test is the airplane mode test. Enable airplane mode on your phone and try to use every AI feature in your journal app — transcription, insights, prompts, mood analysis. If everything works normally, the AI runs on-device (Tier A or B). If AI features break but basic journaling works, the app is likely Tier C with cloud AI. If the app barely functions at all, it is Tier D — cloud-dependent for both storage and AI. This test takes under a minute and requires no technical knowledge.
Related Articles
Try a Tier A Journal App
DailyVox: zero cloud egress, all AI on-device, no servers, no accounts. One of only 2 apps classified as Tier A. Free.
Download on the App Store