Describe the task. We build a clean, answer-verified dataset and deliver it HuggingFace-ready — drop the repo straight into Gradients, TRL, Axolotl, or Unsloth. Every checkable answer is verified in code, not trusted from a model.
Most synthetic datasets trust the language model to be right. This one doesn't. The correct answer is computed independently before the model writes a word — then the model's answer is checked against it.
Each problem is built in code with a known-correct answer as ground truth.
The model writes step-by-step reasoning and a final answer for the problem.
The model's answer is checked against ground truth in code. Mismatches are discarded.
Deduplicated, split train/val/test, documented, pushed to a HuggingFace repo.
A Gradients run costs $100–500 and you still have to bring the data. We supply the verified dataset for a flat per-build price. No setup fee, no per-record meter.
Autonomous agents — call the hsh-finetune-dataset tool on our x402 endpoint. Describe the task, pay in USDC, receive the repo.
Humans — tell us the task, the domain, and how many rows. We scope it, build it, and send the HuggingFace repo. Verifiable tasks are quoted and confirmed fast.