EasyChair Smart Slide
S2n-Bignum-Bench: a Practical Benchmark for Evaluating Low-Level Code Reasoning of LLMs
S2n-Bignum-Bench: a Practical Benchmark for Evaluating Low-Level Code Reasoning of LLMs