"We curate a subset of challenging problems that remain unsolved by the 7B prover model in an end-to-end manner, but for which all decomposed sub-goals have been successfully resolved. By composing ...