You direct. The engine verifies.
Define the domain. Provide the corpus. The engine checks citation support, maps contradictions, and pressures the draft before it reaches a reviewer.
Sources: PubMed, Semantic Scholar, or your own corpus. Evidence verified before delivery. Weak claims rejected early.
How it works
critic_axes:
1. mechanism_plausibility — does the proposed mechanism actually work?
2. evidence_strength — human data > animal > in vitro > theoretical
3. safety_profile — risks, contraindications, failure modes
4. novelty_assessment — genuinely new or already known?
5. testability — can this be validated experimentally?
verdicts: VIABLE | WEAK | REJECTED
typical rejection rate: 50-60% The critic adapts. Pharmacology gets mechanism-of-action attacks. Clinical research gets evidence-hierarchy pressure. Same axes. Domain expertise shifts.
You direct. You judge. The engine verifies between.
Most AI research tools surface papers fast. This tool asks a different question: does your draft survive scrutiny?
The loop repeats until output is either verified or rejected. No hallucination passes. No unsupported claim survives. The researcher controls direction; the engine enforces evidence standards.
1,538 papers. 21 candidate hypotheses. 1 survived The Gate.
A research team ran the complete psychedelic and ketamine antidepressant literature. The verification layer rejected most candidates. One survived with verified support.
the discovery: temporal stacking of mGlu2/3 antagonism after ketamine
Cross-referenced papers rarely read together. The real value: unsupported ideas do not masquerade as conclusions.
All 7 directions, 21 hypotheses
| Direction | Hypotheses | Best Verdict | Key Finding |
|---|---|---|---|
| mGlu2/3 + Ketamine Stacking | 3 | VIABLE | Temporal stacking extends neuroplasticity window |
| 5-HT2A Receptor Modulation | 3 | WEAK | Partial agonist approach needs more specificity data |
| Muscimol + SSRI Interactions | 3 | WEAK | Safety finding: serotonin syndrome risk flagged |
| Neuroplasticity Cascades | 3 | WEAK | BDNF timing windows need dose-response data |
| Default Mode Network | 3 | REJECTED | Mechanism too diffuse for actionable protocol |
| Gut-Brain Axis | 3 | REJECTED | Insufficient mechanistic evidence for psychedelic link |
| Epigenetic Markers | 3 | REJECTED | Too speculative for current evidence base |
It finds risks too.
case study bonus: drug interaction safety signal
The engine flagged a triple interaction: muscimol + trazodone + sertraline creating serotonergic excess. Emerged from combinatorial analysis across 1,538 papers — too diffuse for manual review.
Any domain. The engine catches contradictions and safety signals that fast synthesis flattens away.
Adversarial critique catches failure modes single-model review misses.
3x inference latency. 24-hour turnaround. Requires a well-formed question.
Bring output that needs to survive review.
Any domain with real literature and a real reviewer on the other side.
[ research audit ]