Quality
Prediction Quality
Pick a domain, predict target and input features
Evaluation configuration
Prediction target
Field whose value Aito predicts on the held-out test set.
Test set size100
Capped at 300 to keep latency bounded. Larger = more stable accuracy estimate, slower call.
Input features
These map to { $get: "fieldname" } bindings in the where clause.
Model performanceFrom _evaluate aggregate
Accuracy
Mean rank
lower = better
Geom mean p
calibrated conf.
Gain
vs baseline
Test samples
Train samples
Correct
Evaluation casesFirst 0 of 0 · green=correct, red=wrong
No cases returned. Pick at least one input field.