Skip to content

algs: Evaluate DocsChecker on real data #303

@andreygetmanov

Description

@andreygetmanov

21.10
Validate papers ✅
Validate doc ✅

28.10
Validate anti-paper ✅
Validate anti-dic ❌

31.10:
Try reasoning models:
gpt o3, 04 mini

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions