对于开源或开放API的模型,可直接提交拉取请求(可以选择同时在src目录下更新测试代码)。 如模型未开放公测,请发送测试代码到[email protected],同时将测试结果更新在榜单,并提交拉取请求。我们会在验证结果的真实性之后更新榜单。 数据 我们根据每个 ...
While a lot of recent research focuses on enhancing the textual reasoning capabilities of Large Language Models (LLMs) by optimizing the multi-agent framework or reasoning chains, several benchmark ...
Understanding and reasoning about code semantics is essential for enhancing code LLMs’abilities to solve real-world software engineering (SE) tasks. Although several code reasoning benchmarks exist, ...
科技媒体marktechpost报道,英伟达近日开源了其Open Code Reasoning(OCR)模型套装,包含32B、14B和7B三种参数规模,均采用Apache 2.0许可证发布,模型权重和配置已在Hugging Face平台开放下载。 OCR模型基于Nemotron架构优化,适用于多语言、多任务学习。其中,32B模型面向高 ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果