Sonnet 3.5 Achieves Impressive 49% Score on SWE-bench Verified
Anthropic has recently presented its Sonnet 3.5 model, which has achieved a remarkable score of 49% on the SWE-bench Verified benchmark. This score marks a significant leap from its predecessor’s performance of 33.4%. The announcement not only positions Sonnet 3.5 as a formidable contender among public models but also illustrates Anthropic’s commitment to advancing artificial […]
Sonnet 3.5 Achieves Impressive 49% Score on SWE-bench Verified Read More »