How We Test Apps

Every app we rank is tested by a specialist editor for a minimum of two weeks. Here's exactly how our scoring works.

Our 5-Criteria Scoring System

Each app is scored on a 10-point scale across five criteria. The final score is a weighted average, with weights adjusted for category. A budgeting app, for example, weights accuracy and reliability more heavily than an entertainment app would.

1. UX & Design

Weight: 20%

Ease of use, visual design quality, accessibility, and onboarding experience. We assess how quickly a new user can accomplish core tasks and how the interface holds up under daily use.

Key questions we ask:

→ How long to complete a core task on first launch?
→ Does the interface surface the right information at the right time?
→ Is the app accessible (font sizes, contrast, screen reader support)?

2. Feature Depth

Weight: 30%

Does the app do what it claims? Does it cover edge cases that matter to real users? We compare feature sets against the top three alternatives in each category.

Key questions we ask:

→ Does the core feature work reliably?
→ How does feature breadth compare to alternatives?
→ Are advanced features accessible without confusion?

3. Performance & Reliability

Weight: 25%

Speed, stability, sync reliability, and behavior under adverse conditions. We test on older devices, with poor connectivity, and in edge cases that stress-test the app.

Key questions we ask:

→ Does it crash? How often?
→ How long do actions take vs. alternatives?
→ Does sync work correctly across devices?

4. Value

Weight: 15%

Price relative to feature set and alternatives. We consider free tiers, subscription pricing, one-time purchase options, and what's locked behind paywalls.

Key questions we ask:

→ Is the free tier genuinely useful?
→ Is the price justified by the feature set?
→ Does the pricing model respect the user?

5. Support & Updates

Weight: 10%

Quality of customer support, documentation, and the developer's track record for fixing bugs and releasing meaningful updates.

Key questions we ask:

→ How responsive is support?
→ How often does the app update?
→ Does the developer communicate about changes?

What Scores Mean

Score	Rating	Meaning
9.5 – 10.0	Exceptional	Best-in-class. We recommend this app without reservation.
9.0 – 9.4	Excellent	Outstanding app with minor weaknesses. Strong recommendation.
8.5 – 8.9	Very Good	Solid app that performs well in most scenarios.
8.0 – 8.4	Good	Capable app with notable limitations or better alternatives.
7.0 – 7.9	Decent	Works but has significant weaknesses worth knowing.
Below 7.0	Below Average	We rarely include these. Notable problems.

Our Testing Process

App selection

We identify apps with significant user bases or strong peer recognition, then add them to our testing queue. We do not take requests from developers.

Specialist assignment

Each app goes to the editor who specializes in that category. A general journalist does not review a professional design tool.

Two-week minimum testing

The assigned editor uses the app for at least 14 days in real-world scenarios, not controlled testing environments.

Structured scoring

After testing, the editor scores each criteria against a defined rubric. Scores are reviewed by the Editor-in-Chief.

Comparative check

Before publishing, we verify rankings against the current alternatives to ensure relative scoring is accurate.

Ongoing updates

Rankings are reviewed when apps ship significant updates. A major redesign or new feature set triggers a re-test.

Editorial Independence

Apps Tested does not accept payment for reviews or rankings. No developer can purchase placement in any list.

We do not use affiliate links. Our revenue comes from display advertising, and advertisers have no input on editorial content.

If we receive a free app license for review purposes, we disclose this in the article. It does not affect scores.

All editorial decisions are made by our editors. No external party reviews articles before publication.