Reward models need reward-model QA
Reward model QA is the missing layer that turns step-level preference data into trustable training signal. When…
continue reading..
The evaluator uniqueness primitive: from sybil resistance to agent evaluation
Evaluator uniqueness, the property that one person can prove they are one unique evaluator without disclosing identity,…
continue reading..
Continuous training needs continuous evaluators
Longitudinal evaluation is the human-judgement layer that scales alongside continual model adaptation. A continually retrained model paired…
continue reading..
Your reward model is only as good as your preference data
Preference data integrity is the upstream gate that determines what every distilled, fine-tuned, or RLHF-aligned model is…
continue reading..
When benchmarks break: the case for traceable evaluator provenance
Evaluator provenance is the layer that turns benchmark results from “trust the publisher” claims into independently verifiable…
continue reading..
Signed content for a world where platforms are AI
AI-mediated communication systems measurably shift the opinions of the groups they serve, and the question “did this…
continue reading..
Reputation as public infrastructure
The supply of trusted AI evaluators is bottlenecked not by a shortage of humans but by platform-bound…
continue reading..
Why persistent identity is the missing layer under AI evaluation
Model drift in flagship AI systems is often misattributed to changes in the model when it is,…
continue reading..
From ownership to consent, audit and revocation
On the recent Ontology Privacy Hour, buried inside Nick Ris’s response to a question about why people want…
continue reading..
- 1
- 2
