2,034 bug-revealing cases across three widely-used DL compilers. That's what XCheck found, and it didn't get there by fuzzing random models.
Why Existing Testing Misses the Real Bugs
Most DL compiler testing tools constrain input generation with type rules, then watch for crashes or coverage gains. They overlook the bugs that bite you when you actually deploy: compiler-platform interaction bugs that arise from interleaved effects across compilation passes and execution environments. TVM and ONNX-MLIR both suffer from this blind spot.
Full-Stack Constraints and Assertions
XCheck's key insight: these bugs stem from violated assumptions that span compilation passes and hardware platforms. Instead of constraining just input generation, XCheck extracts full-stack constraints that jointly guide model generation and characterize compilation behaviors. It prioritizes constraints that expose interaction-sensitive behaviors, so generated models exercise deep compilation logic. Then it automatically inserts assertions to monitor symptoms that coverage or pass/fail signals never catch.
The Tally: Memory Overflows, Integer Overflows, Silent Failures
XCheck found 2,034 bug-revealing cases across three compilers. That includes memory overflows, integer overflows, and silent unexpected compilations - cases where the compiler produced wrong output without crashing. Every bug was rooted in compiler-platform interactions, not type mismatches or simple codegen errors.
I've spent years annoyed by testing tools that claim to find deep bugs but only surface crash dumps. XCheck's results suggest the real problem wasn't missing test coverage - it was missing the cross-layer constraints that define correct behavior.
If you're shipping models through TVM or ONNX-MLIR, XCheck's approach gives you a concrete reason to start hunting for similar latent bugs in your own pipeline.
Source: Finding Compiler-Platform Interaction Bugs in Deep Learning Pipelines via Cross-Layer Constraints
Domain: arxiv.org
Comments load interactively on the live page.