Linux大棚 – 不忘初心的技术博客,浮躁时代的安静角落
  •  首页
  •  技术日记
  •  编程
  •  旅游
  •  数码
  •  登录
  1. 标签
  2. Evaluating
  • A COMPREHENSIVE SURVEY ON EVALUATING LARGE LANGUAGE MODEL APPLICATIONS IN THE MEDICAL INDUSTRY

    本文是LLM系列文章,针对《A COMPREHENSIVE SURVEY ON EVALUATING LARGE LANGUAGE MODEL APPLICATIONS IN THE MEDICAL INDUSTRY》的翻译。关于评估医
    Evaluating Large Comprehensive Survey Language
    admin 6月前
    104 0
  • Evaluating Large Language Models: A Comprehensive Survey

    本文是LLM系列文章,针对《Evaluating Large Language Models: A Comprehensive Survey》的翻译。评估大型语言模型:一项综合调查 摘要 1 引言 2 分类和路线图 3 知识和能力评估
    Language Large Evaluating Survey Comprehensive
    admin 6月前
    88 0
  • MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese

    本文是LLM系列文章,针对《MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical L
    Standardized Reliable MedBench Comprehensive Evaluating
    admin 6月前
    106 0
CopyRight © 2022 All Rights Reserved 豫ICP备2021025688号-21
Processed: 0.022 , SQL: 9