How We Test Apps
Every app we rank is tested by a specialist editor for a minimum of two weeks. Here's exactly how our scoring works.
Our 5-Criteria Scoring System
Each app is scored on a 10-point scale across five criteria. The final score is a weighted average, with weights adjusted for category. A budgeting app, for example, weights accuracy and reliability more heavily than an entertainment app would.
1. UX & Design
Weight: 20%Ease of use, visual design quality, accessibility, and onboarding experience. We assess how quickly a new user can accomplish core tasks and how the interface holds up under daily use.
Key questions we ask:
- → How long to complete a core task on first launch?
- → Does the interface surface the right information at the right time?
- → Is the app accessible (font sizes, contrast, screen reader support)?
2. Feature Depth
Weight: 30%Does the app do what it claims? Does it cover edge cases that matter to real users? We compare feature sets against the top three alternatives in each category.
Key questions we ask:
- → Does the core feature work reliably?
- → How does feature breadth compare to alternatives?
- → Are advanced features accessible without confusion?
3. Performance & Reliability
Weight: 25%Speed, stability, sync reliability, and behavior under adverse conditions. We test on older devices, with poor connectivity, and in edge cases that stress-test the app.
Key questions we ask:
- → Does it crash? How often?
- → How long do actions take vs. alternatives?
- → Does sync work correctly across devices?
4. Value
Weight: 15%Price relative to feature set and alternatives. We consider free tiers, subscription pricing, one-time purchase options, and what's locked behind paywalls.
Key questions we ask:
- → Is the free tier genuinely useful?
- → Is the price justified by the feature set?
- → Does the pricing model respect the user?
5. Support & Updates
Weight: 10%Quality of customer support, documentation, and the developer's track record for fixing bugs and releasing meaningful updates.
Key questions we ask:
- → How responsive is support?
- → How often does the app update?
- → Does the developer communicate about changes?
What Scores Mean
| Score | Rating | Meaning |
|---|---|---|
| 9.5 – 10.0 | Exceptional | Best-in-class. We recommend this app without reservation. |
| 9.0 – 9.4 | Excellent | Outstanding app with minor weaknesses. Strong recommendation. |
| 8.5 – 8.9 | Very Good | Solid app that performs well in most scenarios. |
| 8.0 – 8.4 | Good | Capable app with notable limitations or better alternatives. |
| 7.0 – 7.9 | Decent | Works but has significant weaknesses worth knowing. |
| Below 7.0 | Below Average | We rarely include these. Notable problems. |
Our Testing Process
App selection
We identify apps with significant user bases or strong peer recognition, then add them to our testing queue. We do not take requests from developers.
Specialist assignment
Each app goes to the editor who specializes in that category. A general journalist does not review a professional design tool.
Two-week minimum testing
The assigned editor uses the app for at least 14 days in real-world scenarios, not controlled testing environments.
Structured scoring
After testing, the editor scores each criteria against a defined rubric. Scores are reviewed by the Editor-in-Chief.
Comparative check
Before publishing, we verify rankings against the current alternatives to ensure relative scoring is accurate.
Ongoing updates
Rankings are reviewed when apps ship significant updates. A major redesign or new feature set triggers a re-test.
Editorial Independence
Apps Tested does not accept payment for reviews or rankings. No developer can purchase placement in any list.
We do not use affiliate links. Our revenue comes from display advertising, and advertisers have no input on editorial content.
If we receive a free app license for review purposes, we disclose this in the article. It does not affect scores.
All editorial decisions are made by our editors. No external party reviews articles before publication.