Rubric v1.0

How we score AI devices

Every device on The New Hardware is rated against the same ten criteria. Each criterion is scored from 0 to 10 and contributes to the final AI Score in proportion to its weight. Weights sum to 100.

AI Score = round(Σ score × weight ÷ 100, 1)

01. LLM Quality

weight 15

How capable is the underlying language model for reasoning, follow-ups, and accuracy?

0 / 10
Scripted responses, no real LLM, frequent nonsense.
5 / 10
GPT-3.5-class behaviour, often useful but hallucinates.
10 / 10
Frontier-model behaviour: nuanced reasoning, tool-use, low hallucination.

02. Voice Understanding

weight 12

Speech recognition quality across accents, noise, and natural speech.

0 / 10
Only works in quiet rooms with American English.
5 / 10
Solid in clean audio, drops in noise or with accents.
10 / 10
Near-human ASR across accents, languages, and noisy environments.

03. Vision Capability

weight 10

Camera + vision model: object recognition, OCR, scene description.

0 / 10
No camera, or camera is decorative.
5 / 10
Useful for identifying common objects and reading clear text.
10 / 10
Real-time scene understanding, OCR in poor lighting, multimodal Q&A.

04. Memory & Context

weight 10

Does the device remember past conversations, build a useful user profile, and recall relevant context?

0 / 10
Stateless — every session starts from zero.
5 / 10
Short-term memory within a session; manual recall.
10 / 10
Long-term, queryable memory with transparent edit controls.

05. Autonomy

weight 10

Can the device take real-world actions — book, send, buy, plan — on the user's behalf?

0 / 10
Read-only assistant; suggestions only.
5 / 10
A few hardcoded integrations (timer, calendar).
10 / 10
True agent behaviour with reliable multi-step task execution.

06. Privacy & Data Handling

weight 10

Transparency about what is recorded, where data is stored, and user control over deletion.

0 / 10
Opaque data flow, no opt-outs, foreign-jurisdiction storage.
5 / 10
Standard cloud LLM with documented retention and basic deletion.
10 / 10
On-device processing where possible, end-to-end encryption, full deletion and export controls.

07. Battery Life

weight 8

Practical run time for the device's intended use pattern.

0 / 10
Cannot last a typical session of use.
5 / 10
Lasts most of a workday with moderate use.
10 / 10
Multi-day battery in real-world use, fast top-ups.

08. Price / Value

weight 12

Is the device's price justified by what it actually does today?

0 / 10
Egregiously overpriced for the delivered capability.
5 / 10
Fair price for what it does, comparable to alternatives.
10 / 10
Outperforms its price tier; clear value for money.

09. Ecosystem & Updates

weight 8

Companion apps, third-party integrations, and pace of meaningful software updates.

0 / 10
Closed, no updates, abandoned roadmap.
5 / 10
Modest first-party app, occasional updates.
10 / 10
Active SDK, integrations, monthly meaningful updates.

10. Repairability & Sustainability

weight 5

Repair score, modular battery, recycled materials, and end-of-life support.

0 / 10
Glued shut, no parts, e-waste in 18 months.
5 / 10
Some user-serviceable parts, no formal sustainability program.
10 / 10
Modular, repairable, recycled materials, take-back program.

Editorial standards

  • We disclose every affiliate relationship on every page that contains one.
  • Scores never depend on whether a brand pays us. Sponsored listings are clearly labelled.
  • We track and publish device discontinuations — including ones we previously recommended.
  • Rubric weights are reviewed twice a year. Past scores are recomputed when weights change.