0$,$P(X_1^2+X_2^2+X_3^2+X_4^2\\le kX_5^2)=\\alpha$则k=____\n", + "A. $\\frac{1}{4}F_{\\alpha}(4,1)$\n", + "B. $\\frac{1}{4}F_{1-\\alpha}(4,1)$\n", + "C. $4F_{\\alpha}(4,1)$\n", + "D. $4F_{1-\\alpha}(4,1)$\n", + "答案是什么? \n", + "response: 根据\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 15=======\n", + " 89% 16/18 [00:01<00:00, 15.69it/s]\n", + "=======begin 16=======\n", + "question: 设$X_1,X_1,\\cdots X_8$为来自总体$X\\sim N\\left(\\mu_1,1\\right)$的简单样本,$\\bar{X},S_1^2$分別是其对应的样本均值与样本方差。$Y_1,Y_1,\\cdots,Y_7$为来自总$Y\\sim N\\left(\\mu_2,1\\right)$的简单样本,$\\bar{Y},S_2^2$分别是其对应的样本均值与样本方差。下列选项正确的是:____\n", + "A. $\\sum_{i=1}^8\\left(X_i-\\mu_1\\right)^2+\\sum_{i=1}^7\\left(Y_i-\\mu_2\\right)^2 \\sim \\chi^2(15)$\n", + "B. $E\\left(\\sum_{i=1}^8\\left(X_i-\\mu_1\\right)^2+\\sum_{i=1}^7\\left(Y_i-\\mu_2\\right)^2\\right)=15$\n", + "C. $\\mathrm{D}(\\bar{X}+\\bar{Y})=\\frac{1}{8}+\\frac{1}{7}$\n", + "D. $\\bar{X}-\\bar{Y} \\sim \\mathrm{N}\\left(\\mu_1-\\mu_2, \\frac{1}{8}+\\frac{1}{7}\\right)$\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 若随机变量X的分布函数为$F(x)=pF_1(x)+qF_2(x)$,其中$F_1(x)$,$F_2(x)$为两个分布函数,常数p,q满足:$p>0$,$q>0$,$p+q=1$,那么X的分布叫作$F_1(x),F_2(x)$的混合分布.设$\\mu_1,\\mu_2$分别为$F_1(x),F_2(x)$的期望,$\\sigma_1^2,\\sigma_2^2$分别为$F_1(\\mathrm{x})$,$F_2(\\mathrm{x})$的方差,则$DX=$____\n", + "A. $p \\sigma_1^2+q \\sigma_2^2$\n", + "B. $p^2 \\sigma_1^2+q^2 \\sigma_2^2$\n", + "C. $p \\sigma_1^2+q \\sigma_2^2+p q\\left(\\mu_1-\\mu_2\\right)^2$\n", + "D. $p \\sigma_1^2+q \\sigma_2^2+p q\\left(\\sigma_1-\\sigma_2\\right)^2$\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: C \n", + "\n", + "=======end 17=======\n", + "100% 18/18 [00:01<00:00, 15.90it/s]\n", + "Subject: probability_and_statistics\n", + "Acc: 22.22222222222222\n", + "0.9615384615384616 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of high_school_chinese!\n", + " 0% 0/19 [00:00, ?it/s]\n", + "=======begin 0=======\n", + "question: 下文划线处选填哪项最恰切____\n", + "作物同病菌进行斗争,情形是复杂的:____,就是同一个抗病品种,对不同的病菌的抵抗方式也不一样。\n", + "A. 不同的抗病品种抵抗病菌的方式不仅有所不同\n", + "B. 不同的抗病品种不仅抵抗病菌的方式有所不同\n", + "C. 不仅不同的抗病品种抵抗病菌的方式有所不同\n", + "D. 固然不同的抗病品种抵抗病菌的方式有所不同\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: C \n", + "\n", + "=======end 0=======\n", + "\n", + "=======begin 1=======\n", + "question: 下列各句中,没有语病的一句是____\n", + "A. 某些吃惯“大锅饭”的职工对劳动人为制度的革新,切实其实会感到不适应。\n", + "B. “全面建设小康社会”的目标,对于我们感到十分亲热;它已经成为全党天下人民在新世纪中奋斗的行动纲领。\n", + "C. 日本辅弼前去“靖国神社”为东条英机等战争罪犯招魂的反动行径,对于曾饱受侵略战争祸害的中国人民和其他亚洲国家的人民是不克不及容忍的。\n", + "D. 世界重量级拳击冠军易斯接受了女皇颁发的皇家勋章,以表彰他为英国拳击事业做出的贡献。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 1=======\n", + " 11% 2/19 [00:00<00:01, 15.79it/s]\n", + "=======begin 2=======\n", + "question: 下列各句中,没有语病的一句是____\n", + "A. 在对WTO问题的关注上,过去主要集中在行业、企业等方面所面临的压力上,多是从微观层面考虑问题,而对于经济体制等宏观问题却思考甚少。\n", + "B. 对在如何使学生掌握现代化生活所必须的知识技能的问题上,该校的老师作过深入详尽的研究。\n", + "C. 著名词曲作家付林创作《妈妈的吻》《小螺号》《故园之恋》等脍炙生齿的歌曲而蜚声乐坛。\n", + "D. 载人航天技术,是我国高新科技水平显著提高的重要标志,也是我国综合国力显著提高的重要体现。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 2=======\n", + "\n", + "=======begin 3=======\n", + "question: 下列各句中,没有语病的一句是____\n", + "A. 记者从新闻发布会上获悉,10月26日,辽宁省锦州市黑山县出现禽流感疫情已得到有效控制。\n", + "B. 王越洲和姚佳琪赶赴航天城,他们将从航天员的手中接过搭乘“神舟六号”进行太空之旅的自己的画作,并得到纪念证书。\n", + "C. 不管《泰晤士报》这个排行榜的权威程度颇受国人质疑,但据专家称,排行榜是能够说明一些问题的。\n", + "D. 进入乌镇,信步于幽深的街巷中,你就会觉得自己好像浏览着一部关于江南水乡文化的线装书。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 3=======\n", + " 21% 4/19 [00:00<00:00, 15.56it/s]\n", + "=======begin 4=======\n", + "question: 列字注音全对的一项是\t____\n", + "A. 复杂(fù)\t按捺(nài)\t混淆(xiáo)\t笔画纤细(qiān)\n", + "B. 弥补(mí)\t蓓蕾(bèi)\t发酵(jiào)\t不着边际(zhuó)\n", + "C. 拂晓(fó)\t质量(zhǐ)\t高档(dàng)\t大腹便便(pián)\n", + "D. 勒索(lē)\t结束(sù)\t喧嚣(xiāo)\t酗酒滋事(xù)\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 4=======\n", + "\n", + "=======begin 5=======\n", + "question: 下列各句中,没有语病的一句是____\n", + "A. 半期考试之后,因为她这样好的成绩,获得了老师和同学们的颂扬。\n", + "B. 全校师生在雷锋精神的鼓舞下,好人好事,如雨后春笋似的涌现出来。\n", + "C. 他们襟怀胸襟祖国,放眼天下,在高手如林的雅典奥运会上,大力发扬了敢拼敢搏,终于夺得了冠军。\n", + "D. 这个节目表达了同学们要以实际行动向雷锋同志学习,以优异的成绩向党报告的决心。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 5=======\n", + " 32% 6/19 [00:00<00:00, 15.67it/s]\n", + "=======begin 6=======\n", + "question: 下列各句标点符号使用合乎规范的一项是____\n", + "A. 对李清照的诗,比之那“寻寻觅觅,冷冷清清,凄凄惨惨戚戚”的哀怨,我倒更喜欢她的“生当作人杰,死亦为鬼雄”的刚烈。\n", + "B. 昨日,武汉工业学院三名学生宣布:他们经过连续奋战,已经找到了三种简便快速检测奶粉中是否含有三聚氰胺的办法,可见普通市民也可以自己动手检测奶粉中有无三聚氰胺。\n", + "C. 为给地铁2号线和4号线让路,武汉市最大的广场——洪山广场将被拆除重建的消息传出后,许多人都非常关心未来的广场将怎么建?那里的几百株树木将怎么办?\n", + "D. “绿动未来2008”环保方案评选活动开展以来,大赛组委会征集到高质量参赛方案367份,内容涉及新能源、新材料的开发与利用、发展绿色经济、环境保护和生态治理新技术等诸多方面。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 6=======\n", + "\n", + "=======begin 7=======\n", + "question: 下列各句中,没有语病、句意明确的一项是____\n", + "A. 近年来骑马爱好者剧增,使得赛马运动发展迅速,相应的,一些骑马俱乐部也应运而生。\n", + "B. 他饰演了一个英雄人物,观众被深深打动了,说这是我们的偶像。\n", + "C. 在引进竞争机制的情况下,如果还想捧着“铁饭碗”不放,那就是一厢情愿。\n", + "D. 艺术教育无论在德育、智育,在人格的完善、性情的陶冶等方面都是教育行为中的一个重要组成部分。\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: C \n", + "\n", + "=======end 7=======\n", + " 42% 8/19 [00:00<00:00, 15.68it/s]\n", + "=======begin 8=======\n", + "question: 下文横线处选填哪项最恰当____\n", + "卢梅坡的诗句“梅须逊雪三分白,雪却输梅-段香”,常被人引用,借此说明____。\n", + "A. 任何人和事物都各有缺憾\n", + "B. 任何人和事物都各有千秋\n", + "C. 任何人和事物都各有短长\n", + "D. 任何人和事物者咯有优势\n", + "答案是什么? \n", + "response: B\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 8=======\n", + "\n", + "=======begin 9=======\n", + "question: 下列词语的注音有错误的一项是____\n", + "A. 思量(liáng)\t度量(liàng)\t胸脯(pú)\t果脯(fǔ)\n", + "B. 颤(zhàn)抖\t颤(chàn)栗\t靓(jìng)妆\t靓(liàng)女\n", + "C. 阽(diàn)危\t玷(diàn)辱\t胡诌(zhōu)\t谄(chǎn)谀\n", + "D. 瞋目(chēn)\t瞠(chēng)目结舌 觊觎(yú)\t面面相觑(qù)\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 9=======\n", + " 53% 10/19 [00:00<00:00, 15.67it/s]\n", + "=======begin 10=======\n", + "question: 下列各句中,没有语病的一句是____\n", + "A. 这届体育节会徽和吉祥物设计的应征者大多以青年体育爱好者为主。\n", + "B. 这届“挑战杯”竞赛的参赛高校数量和作品质量,都有了明显提高。\n", + "C. 师傅让位于徒弟,从一个侧面反映了人们已不再惟师是尊,而是开始强调多方面的能力与素养。\n", + "D. 以生产内衣为主的三枪集团,是今年在全国同行业中产值率先突破十亿大关的一个著名品牌。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 10=======\n", + "\n", + "=======begin 11=======\n", + "question: 下文划线处选填哪项才恰当____\n", + "翌日,贾母带着贾蓉媳妇乘坐一乘驮轿,王夫人在后,亦乘坐一乘驮轿;贾珍骑马,率领众家丁围护;____,并放些随换的衣包等件。\n", + "A. 婆子丫环等乘坐几辆大车\n", + "B. 又有几辆大车,婆子丫环等坐\n", + "C. 又有几辆大车,与婆子丫环等坐\n", + "D. 几辆大车,婆子丫环等坐\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: C \n", + "\n", + "=======end 11=======\n", + " 63% 12/19 [00:00<00:00, 15.83it/s]\n", + "=======begin 12=======\n", + "question: 下列词语中注音全都正确的一项是____\n", + "A. 巨擘(bò) 蓓蕾(lěi) 前倨后恭(jū)\n", + "B. 中伤(zhōnɡ) 莅临(lì) 鞭辟入里(bì)\n", + "C. 曲解(qū) 骁勇(xiāo) 余勇可贾(ɡǔ)\n", + "D. 蜚声(fēi) 阜盛(fù) 量体裁衣(liánɡ)\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 12=======\n", + "\n", + "=======begin 13=======\n", + "question: 下列各句中加下划线的成语使用恰当的一句是:____\n", + "A. 你的这个$\\underline{不情之请}$让我很为难,过两天我再答复你吧。\n", + "B. 对于学到的原理,他都要拿实物来做实验,求得彻底了解,决不$\\underline{囫囵吞枣}$,马虎了事。\n", + "C. 峨眉山是闻名中外的旅游胜地,素有“峨眉天下秀”之誉其巍峨磅礴,重峦叠嶂,山山有奇景,十里不同天,真是$\\underline{巧夺天工}$。\n", + "D. 在学习上也是这样,吃别人嚼过的馍不香,要善于动脑筋,$\\underline{师心自用}$,才能学深学透。\n", + "答案是什么? \n", + "response: A\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 13=======\n", + " 74% 14/19 [00:00<00:00, 15.94it/s]\n", + "=======begin 14=======\n", + "question: 下列各句中,没有语病的一项是____\n", + "A. 以“伟大历程辉煌成就”为主题的纪念新中国成立70周年展览在北京拉开帷幕,该展览采用编年体的形式为主全方位回顾了中国人民走过的辉煌历程。\n", + "B. 经过主创团队对经典故事的大胆改编,《哪吒》不仅保留了原作精华,还融入了具有时代元素的内容,因此成功斩获暑期电影最佳口碑。\n", + "C. 网络谣言对社会的破坏力是巨大的,如不及时扑灭,对公众造成的创伤,乃至引起社会动荡,也不是完全不可能的。\n", + "D. 垃圾分类工作能否执行到位,一方面取决于政府相关法律法规的约束力,另一方面也取决于市民的环保意识,尤其是对垃圾分类意义的认识。\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 14=======\n", + "\n", + "=======begin 15=======\n", + "question: 填入下文划线处恰当的一句是____\n", + "自从“五四”以来,翻译介绍先进国家的文化成果,就成了中国人民的迫切要求。____。\n", + "A. 这些翻译作品促进了中国学术文化的发展,同时也影响了中国的书面语言\n", + "B. 翻译作品日渐其多,一方面这些作品提高了中国学术文化的素养,另一方面也促进了中国书面语言的发展\n", + "C. 翻译作品日见其多,这些作品促进了中国学术文化的发展,同时也影响了中国的书面语言\n", + "D. 这些翻译作品提高了中国学术文化的素养,同时也促进了中国书面语言的发展\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: C \n", + "\n", + "=======end 15=======\n", + " 84% 16/19 [00:01<00:00, 15.89it/s]\n", + "=======begin 16=======\n", + "question: 填入下面横线处的句子,与上下文衔接最恰当的一句是____\n", + "《毛诗序》是先秦儒家诗论的总结,其中心内容是阐述诗歌与封建政教的关系。____。“正得失,动天地,感鬼神,莫近于诗。先王以是经夫妇,成孝敬,厚人伦,美教化,移风俗。”因为诗歌具有感染的力量,所以是封建统治者用以维护政教的有力工具。\n", + "A. 久它认为诗歌不仅是社会治乱、政教得失的反映,而且反过来可以维护封建统治和封建秩序\n", + "B. 它认为不仅诗歌是政教得失、社会治乱的反映,而且反过来可以维护封建统治和封建秩序\n", + "C. 它认为诗歌不但能维护封建统治和封建秩序,而且能反映社会治乱、民生苦乐\n", + "D. 它认为由于诗歌具有强大的艺术感染力,故而封建统治者都要用它来维护封建统治和秩序\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 填入下面横线处的句子,与上句衔接最恰当的一组是____\n", + "公安干警及时赶赴现场侦察,中午12时,____。\n", + "A. 在家里犯罪嫌疑人被抓获,全部赃物和赃款也同时起获\n", + "B. 在犯罪嫌疑人家里将其抓获,全部赃物和赃款也同时起获\n", + "C. 犯罪嫌疑人在家里被抓获,并起获了全部赃物和赃款\n", + "D. 在犯罪嫌疑人家里将其抓获,并起获了全部赃物和赃款\n", + "答案是什么? \n", + "response: A\n", + "ans: B\n", + "ground truth: D \n", + "\n", + "=======end 17=======\n", + " 95% 18/19 [00:01<00:00, 15.77it/s]\n", + "=======begin 18=======\n", + "question: 下列词语中注音全都正确的一项是____\n", + "A. 接洽(qià) 掮客(qián) 悭吝(jiàn) 地壳(qiào)\n", + "B. 刚劲(jìn) 舐犊(shì) 龋齿(qǔ) 租赁(lìn)\n", + "C. 畏葸(sī) 怆然(chuànɡ) 皈依(ɡuī) 干涸(hé)\n", + "D. 复辟(bì) 巷道(hànɡ) 炽热(chì) 眼睑(jiǎn)\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 18=======\n", + "100% 19/19 [00:01<00:00, 15.76it/s]\n", + "Subject: high_school_chinese\n", + "Acc: 31.57894736842105\n", + "0.9807692307692307 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of middle_school_physics!\n", + " 0% 0/19 [00:00, ?it/s]\n", + "=======begin 0=======\n", + "question: 在全国中小学安全教育平台中,安全用电常识是其中一项重要的教育内容。下列做法符合安全用电要求的是____\n", + "A. 用铜丝替代保险丝\n", + "B. 更换灯泡时断开电源开关\n", + "C. 开关接在灯泡和零线之间\n", + "D. 使用测电笔时手接触笔尖金属体\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: B \n", + "\n", + "=======end 0=======\n", + "\n", + "=======begin 1=======\n", + "question: 四冲程柴油机在工作过程中,将内能转化为机械能的冲程是____\n", + "A. 吸气冲程\n", + "B. 压缩冲程\n", + "C. 做功冲程\n", + "D. 排气冲程\n", + "答案是什么? \n", + "response: A\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 1=======\n", + " 11% 2/19 [00:00<00:01, 16.29it/s]\n", + "=======begin 2=======\n", + "question: 歌词“小小竹排江中游,巍巍青山两岸走”,前句描述的运动物体和后一句的参照物分别是____\n", + "A. 青山 青山\n", + "B. 竹排 青山\n", + "C. 竹排 竹排\n", + "D. 青山 竹排\n", + "答案是什么? \n", + "response: 这\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 2=======\n", + "\n", + "=======begin 3=======\n", + "question: 头球(运动员用头碰撞飞行中的足球)是足球比赛中常用的技术,下列说法正确的是____\n", + "A. 头球过程中,头对足球的力改变了足球的运动状态\n", + "B. 足球被顶飞,是因为头对足球的力大于足球对头的力\n", + "C. 头对足球的作用力消失时,足球的惯性也消失\n", + "D. 足球在空中飞行时,以运动员为参照物,足球是静止的\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 3=======\n", + " 21% 4/19 [00:00<00:00, 16.40it/s]\n", + "=======begin 4=======\n", + "question: 自行车的各个部分中,减小了有害摩擦的是____\n", + "A. 车胎\n", + "B. 车把\n", + "C. 车轴\n", + "D. 脚踏板面\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 4=======\n", + "\n", + "=======begin 5=======\n", + "question: 下列实例中关于压强和摩擦力的说法正确的是____\n", + "A. 轴承中装有滚珠是为了增大摩擦\n", + "B. 磁悬浮列车悬浮行驶是为了增大摩擦\n", + "C. 鸟的嘴很尖细,在凿树时可以减小压强\n", + "D. 月球车装有很多轮子是为了减小压强\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 5=======\n", + " 32% 6/19 [00:00<00:00, 16.20it/s]\n", + "=======begin 6=======\n", + "question: 对于静止在水平轨道上的“复兴号”列车,下列分析中正确的是____\n", + "A. 列车所受重力和列车对铁轨的压力是一对相互作用力\n", + "B. 列军所受重力和铁轨对列车的支持力是一对相互作用力\n", + "C. 列车所受重力和铁轨对列车的支持力是一对平衡力\n", + "D. 列车对铁轨的压力和铁轨对列车的支持力是一对平衡力\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: C \n", + "\n", + "=======end 6=======\n", + "\n", + "=======begin 7=======\n", + "question: “万物生长靠太阳”,绿色植物的生长需要阳光。物理学研究表明,不透明物体的颜色是由它反射的色光决定的,由此可以推测,不利于绿色植物生长的光是____\n", + "A. 红光\n", + "B. 黄光\n", + "C. 绿光\n", + "D. 紫光\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 7=======\n", + " 42% 8/19 [00:00<00:00, 16.14it/s]\n", + "=======begin 8=======\n", + "question: 下列过程,属于内能转化为机械能的是____\n", + "A. 从滑梯上滑下时臀部发热\n", + "B. 电热水壶烧水\n", + "C. 内燃机的做功冲程\n", + "D. 向下压活塞引燃棉花\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: C \n", + "\n", + "=======end 8=======\n", + "\n", + "=======begin 9=======\n", + "question: 能解释“倒影”形成的是____\n", + "A. 光的色散\n", + "B. 光的折射\n", + "C. 光的反射\n", + "D. 光的直线传播\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: C \n", + "\n", + "=======end 9=======\n", + " 53% 10/19 [00:00<00:00, 16.20it/s]\n", + "=======begin 10=======\n", + "question: 中国国家航天局宣布,2018年5月21日5时28分,我国在西昌卫星发射中心用“长征四号丙”运载火箭,成功将“鹊桥号”中继星发射升空,为“嫦娥四号”月球探测任务提供地月间的中继通信,负责地球与未来“嫦娥四号”通信的中继接力。下列说法正确的是____\n", + "A. 中继星与地月间不可能靠电磁波通信\n", + "B. 地球和太阳系中的其他行星起源于不同的星云\n", + "C. 发射当天地球运行在绕太阳公转轨道中的夏至与秋分之间\n", + "D. 月球的自转周期和公转周期相同,人类只能看到月球的正面\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: D \n", + "\n", + "=======end 10=======\n", + "\n", + "=======begin 11=======\n", + "question: 生活处处有物理,以下估测最接近生活实际的是____\n", + "A. 宿迁六月份平均气温约为10 °C\n", + "B. 初中生背负沉重的书包上学,书包平均重300 N\n", + "C. 初中生课桌高度约为75 cm\n", + "D. 中考体育考试中某同学50 m短跑成绩约为4 s\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: C \n", + "\n", + "=======end 11=======\n", + " 63% 12/19 [00:00<00:00, 16.19it/s]\n", + "=======begin 12=======\n", + "question: 下列实例中,属于增大摩擦的是____\n", + "A. 往自行车轴承中加润滑油\n", + "B. 行李箱下安装滚动轮子\n", + "C. 骑自行车刹车时用力捏闸\n", + "D. 将滑梯的滑道做得光滑\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 12=======\n", + "\n", + "=======begin 13=======\n", + "question: 下列做法中,符合安全用电原则的是____\n", + "A. 高压线下钓鱼\n", + "B. 机壳没有接地\n", + "C. 绝缘皮破损\n", + "D. 安装避雷针\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 13=======\n", + " 74% 14/19 [00:00<00:00, 16.25it/s]\n", + "=======begin 14=======\n", + "question: 超导现象是指某些物质在温度很低时电阻变为零的现象。如果某种超导材料能应用于实际,最适合用来制作____\n", + "A. 保险丝\n", + "B. 输电导线\n", + "C. 电炉丝\n", + "D. 变阻器的电阻丝\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 14=======\n", + "\n", + "=======begin 15=======\n", + "question: 棒球比赛时,向斜上方击球时的情景中,下列有关说法正确的是____\n", + "A. 击球的一瞬间,棒对球的力大于球对棒的力\n", + "B. 球在上升过程中,重力势能转化为动能\n", + "C. 球上升到最高点时,若所受力全部消失,球将做减速直线运动\n", + "D. 球下落过程中速度越来越大,因为重力改变了球的运动状态\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 15=======\n", + " 84% 16/19 [00:00<00:00, 16.29it/s]\n", + "=======begin 16=======\n", + "question: 下列关于力和运动的说法,正确的是____\n", + "A. 物体运动状态发生改变,一定受到力的作用\n", + "B. 行驶的汽车急刹车时,乘客会出现向后倾的现象\n", + "C. 用力推桌子,桌子静止不动,因为推力小于摩擦阻力\n", + "D. 踢出去的足球能在空中飞行,是因为足球没有受到力的作用\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 声音可以表达情感,传递信息,对于声现象的理解正确的是____\n", + "A. 教师讲课的声音是由声带振动产生的\n", + "B. “静止鸣笛”是在传播过程中减弱噪音\n", + "C. 声音的振幅越大,音调越高\n", + "D. 只要物体在振动,我们就能听到声音\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 17=======\n", + " 95% 18/19 [00:01<00:00, 16.36it/s]\n", + "=======begin 18=======\n", + "question: 为加强校园安全管理,在校内安装监控摄像机,来自物体的光经过摄像机的镜头后形成____\n", + "A. 倒立、缩小的实像\n", + "B. 正立、放大的实像\n", + "C. 倒立、放大的虚像\n", + "D. 正立、缩小的虚像\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 18=======\n", + "100% 19/19 [00:01<00:00, 16.27it/s]\n", + "Subject: middle_school_physics\n", + "Acc: 36.8421052631579\n", + "Accuracy:\n", + "law : 25.0\n", + "environmental_impact_assessment_engineer : 45.16129032258065\n", + "middle_school_biology : 47.61904761904762\n", + "college_chemistry : 29.166666666666668\n", + "college_economics : 34.54545454545455\n", + "middle_school_mathematics : 26.31578947368421\n", + "computer_architecture : 28.571428571428573\n", + "high_school_mathematics : 16.666666666666668\n", + "college_programming : 37.83783783783784\n", + "computer_network : 31.57894736842105\n", + "basic_medicine : 47.36842105263158\n", + "urban_and_rural_planner : 36.95652173913044\n", + "logic : 50.0\n", + "civil_servant : 36.170212765957444\n", + "art_studies : 39.39393939393939\n", + "advanced_mathematics : 26.31578947368421\n", + "electrical_engineer : 32.432432432432435\n", + "accountant : 32.6530612244898\n", + "operating_system : 42.10526315789474\n", + "middle_school_politics : 57.142857142857146\n", + "sports_science : 36.8421052631579\n", + "middle_school_chemistry : 30.0\n", + "marxism : 52.63157894736842\n", + "fire_engineer : 25.806451612903224\n", + "middle_school_geography : 8.333333333333334\n", + "high_school_history : 50.0\n", + "professional_tour_guide : 37.93103448275862\n", + "middle_school_history : 31.818181818181817\n", + "modern_chinese_history : 39.130434782608695\n", + "clinical_medicine : 36.36363636363637\n", + "high_school_biology : 52.63157894736842\n", + "high_school_politics : 21.05263157894737\n", + "tax_accountant : 34.69387755102041\n", + "teacher_qualification : 56.81818181818182\n", + "high_school_geography : 26.31578947368421\n", + "high_school_chemistry : 31.57894736842105\n", + "plant_protection : 54.54545454545455\n", + "legal_professional : 39.130434782608695\n", + "high_school_physics : 26.31578947368421\n", + "ideological_and_moral_cultivation : 42.10526315789474\n", + "veterinary_medicine : 39.130434782608695\n", + "physician : 30.612244897959183\n", + "college_physics : 21.05263157894737\n", + "discrete_mathematics : 43.75\n", + "mao_zedong_thought : 58.333333333333336\n", + "education_science : 34.48275862068966\n", + "business_administration : 33.333333333333336\n", + "chinese_language_and_literature : 43.47826086956522\n", + "metrology_engineer : 37.5\n", + "probability_and_statistics : 22.22222222222222\n", + "high_school_chinese : 31.57894736842105\n", + "middle_school_physics : 36.8421052631579\n" + ] + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "### 第三步:查看预测结果\n", + "\n", + "运行以下命令查看最终结果,json最后的ALL里会显示,这一次运行结果是:\n", + "```\n", + " \"All\": {\n", + " \"score\": 0.36701337295690933,\n", + " \"num\": 1346,\n", + " \"correct\": 494.0\n", + " }\n", + "```\n", + "\n", + "上述结果与我们论文中汇报的zero-shot 36.7(%)一致。需要注意的是解码存在随机性,如果希望多次运行可将`n_times`改为需要运行的次数。\n" + ], + "metadata": { + "id": "6ZW0bynVoP5K" + } + }, + { + "cell_type": "code", + "source": [ + "!cat ./ceval-output/take0/summary.json" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "W84IQ1RGraet", + "outputId": "79748646-3092-40ca-f980-2ea88078e420" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "{\n", + " \"law\": {\n", + " \"score\": 25.0,\n", + " \"num\": 24,\n", + " \"correct\": 6.0\n", + " },\n", + " \"environmental_impact_assessment_engineer\": {\n", + " \"score\": 45.16129032258065,\n", + " \"num\": 31,\n", + " \"correct\": 14.0\n", + " },\n", + " \"middle_school_biology\": {\n", + " \"score\": 47.61904761904762,\n", + " \"num\": 21,\n", + " \"correct\": 10.0\n", + " },\n", + " \"college_chemistry\": {\n", + " \"score\": 29.166666666666668,\n", + " \"num\": 24,\n", + " \"correct\": 7.0\n", + " },\n", + " \"college_economics\": {\n", + " \"score\": 34.54545454545455,\n", + " \"num\": 55,\n", + " \"correct\": 19.0\n", + " },\n", + " \"middle_school_mathematics\": {\n", + " \"score\": 26.31578947368421,\n", + " \"num\": 19,\n", + " \"correct\": 4.999999999999999\n", + " },\n", + " \"computer_architecture\": {\n", + " \"score\": 28.571428571428573,\n", + " \"num\": 21,\n", + " \"correct\": 6.0\n", + " },\n", + " \"high_school_mathematics\": {\n", + " \"score\": 16.666666666666668,\n", + " \"num\": 18,\n", + " \"correct\": 3.0\n", + " },\n", + " \"college_programming\": {\n", + " \"score\": 37.83783783783784,\n", + " \"num\": 37,\n", + " \"correct\": 14.0\n", + " },\n", + " \"computer_network\": {\n", + " \"score\": 31.57894736842105,\n", + " \"num\": 19,\n", + " \"correct\": 6.0\n", + " },\n", + " \"basic_medicine\": {\n", + " \"score\": 47.36842105263158,\n", + " \"num\": 19,\n", + " \"correct\": 9.000000000000002\n", + " },\n", + " \"urban_and_rural_planner\": {\n", + " \"score\": 36.95652173913044,\n", + " \"num\": 46,\n", + " \"correct\": 17.0\n", + " },\n", + " \"logic\": {\n", + " \"score\": 50.0,\n", + " \"num\": 22,\n", + " \"correct\": 11.0\n", + " },\n", + " \"civil_servant\": {\n", + " \"score\": 36.170212765957444,\n", + " \"num\": 47,\n", + " \"correct\": 17.0\n", + " },\n", + " \"art_studies\": {\n", + " \"score\": 39.39393939393939,\n", + " \"num\": 33,\n", + " \"correct\": 13.0\n", + " },\n", + " \"advanced_mathematics\": {\n", + " \"score\": 26.31578947368421,\n", + " \"num\": 19,\n", + " \"correct\": 4.999999999999999\n", + " },\n", + " \"electrical_engineer\": {\n", + " \"score\": 32.432432432432435,\n", + " \"num\": 37,\n", + " \"correct\": 12.0\n", + " },\n", + " \"accountant\": {\n", + " \"score\": 32.6530612244898,\n", + " \"num\": 49,\n", + " \"correct\": 16.0\n", + " },\n", + " \"operating_system\": {\n", + " \"score\": 42.10526315789474,\n", + " \"num\": 19,\n", + " \"correct\": 8.0\n", + " },\n", + " \"middle_school_politics\": {\n", + " \"score\": 57.142857142857146,\n", + " \"num\": 21,\n", + " \"correct\": 12.0\n", + " },\n", + " \"sports_science\": {\n", + " \"score\": 36.8421052631579,\n", + " \"num\": 19,\n", + " \"correct\": 7.0\n", + " },\n", + " \"middle_school_chemistry\": {\n", + " \"score\": 30.0,\n", + " \"num\": 20,\n", + " \"correct\": 6.0\n", + " },\n", + " \"marxism\": {\n", + " \"score\": 52.63157894736842,\n", + " \"num\": 19,\n", + " \"correct\": 9.999999999999998\n", + " },\n", + " \"fire_engineer\": {\n", + " \"score\": 25.806451612903224,\n", + " \"num\": 31,\n", + " \"correct\": 8.0\n", + " },\n", + " \"middle_school_geography\": {\n", + " \"score\": 8.333333333333334,\n", + " \"num\": 12,\n", + " \"correct\": 1.0\n", + " },\n", + " \"high_school_history\": {\n", + " \"score\": 50.0,\n", + " \"num\": 20,\n", + " \"correct\": 10.0\n", + " },\n", + " \"professional_tour_guide\": {\n", + " \"score\": 37.93103448275862,\n", + " \"num\": 29,\n", + " \"correct\": 11.0\n", + " },\n", + " \"middle_school_history\": {\n", + " \"score\": 31.818181818181817,\n", + " \"num\": 22,\n", + " \"correct\": 7.0\n", + " },\n", + " \"modern_chinese_history\": {\n", + " \"score\": 39.130434782608695,\n", + " \"num\": 23,\n", + " \"correct\": 9.0\n", + " },\n", + " \"clinical_medicine\": {\n", + " \"score\": 36.36363636363637,\n", + " \"num\": 22,\n", + " \"correct\": 8.000000000000002\n", + " },\n", + " \"high_school_biology\": {\n", + " \"score\": 52.63157894736842,\n", + " \"num\": 19,\n", + " \"correct\": 9.999999999999998\n", + " },\n", + " \"high_school_politics\": {\n", + " \"score\": 21.05263157894737,\n", + " \"num\": 19,\n", + " \"correct\": 4.0\n", + " },\n", + " \"tax_accountant\": {\n", + " \"score\": 34.69387755102041,\n", + " \"num\": 49,\n", + " \"correct\": 17.0\n", + " },\n", + " \"teacher_qualification\": {\n", + " \"score\": 56.81818181818182,\n", + " \"num\": 44,\n", + " \"correct\": 25.0\n", + " },\n", + " \"high_school_geography\": {\n", + " \"score\": 26.31578947368421,\n", + " \"num\": 19,\n", + " \"correct\": 4.999999999999999\n", + " },\n", + " \"high_school_chemistry\": {\n", + " \"score\": 31.57894736842105,\n", + " \"num\": 19,\n", + " \"correct\": 6.0\n", + " },\n", + " \"plant_protection\": {\n", + " \"score\": 54.54545454545455,\n", + " \"num\": 22,\n", + " \"correct\": 12.0\n", + " },\n", + " \"legal_professional\": {\n", + " \"score\": 39.130434782608695,\n", + " \"num\": 23,\n", + " \"correct\": 9.0\n", + " },\n", + " \"high_school_physics\": {\n", + " \"score\": 26.31578947368421,\n", + " \"num\": 19,\n", + " \"correct\": 4.999999999999999\n", + " },\n", + " \"ideological_and_moral_cultivation\": {\n", + " \"score\": 42.10526315789474,\n", + " \"num\": 19,\n", + " \"correct\": 8.0\n", + " },\n", + " \"veterinary_medicine\": {\n", + " \"score\": 39.130434782608695,\n", + " \"num\": 23,\n", + " \"correct\": 9.0\n", + " },\n", + " \"physician\": {\n", + " \"score\": 30.612244897959183,\n", + " \"num\": 49,\n", + " \"correct\": 15.0\n", + " },\n", + " \"college_physics\": {\n", + " \"score\": 21.05263157894737,\n", + " \"num\": 19,\n", + " \"correct\": 4.0\n", + " },\n", + " \"discrete_mathematics\": {\n", + " \"score\": 43.75,\n", + " \"num\": 16,\n", + " \"correct\": 7.0\n", + " },\n", + " \"mao_zedong_thought\": {\n", + " \"score\": 58.333333333333336,\n", + " \"num\": 24,\n", + " \"correct\": 14.0\n", + " },\n", + " \"education_science\": {\n", + " \"score\": 34.48275862068966,\n", + " \"num\": 29,\n", + " \"correct\": 10.000000000000002\n", + " },\n", + " \"business_administration\": {\n", + " \"score\": 33.333333333333336,\n", + " \"num\": 33,\n", + " \"correct\": 11.0\n", + " },\n", + " \"chinese_language_and_literature\": {\n", + " \"score\": 43.47826086956522,\n", + " \"num\": 23,\n", + " \"correct\": 10.0\n", + " },\n", + " \"metrology_engineer\": {\n", + " \"score\": 37.5,\n", + " \"num\": 24,\n", + " \"correct\": 9.0\n", + " },\n", + " \"probability_and_statistics\": {\n", + " \"score\": 22.22222222222222,\n", + " \"num\": 18,\n", + " \"correct\": 4.0\n", + " },\n", + " \"high_school_chinese\": {\n", + " \"score\": 31.57894736842105,\n", + " \"num\": 19,\n", + " \"correct\": 6.0\n", + " },\n", + " \"middle_school_physics\": {\n", + " \"score\": 36.8421052631579,\n", + " \"num\": 19,\n", + " \"correct\": 7.0\n", + " },\n", + " \"grouped\": {\n", + " \"STEM\": {\n", + " \"correct\": 143.0,\n", + " \"num\": 430,\n", + " \"score\": 0.3325581395348837\n", + " },\n", + " \"Social Science\": {\n", + " \"correct\": 111.0,\n", + " \"num\": 275,\n", + " \"score\": 0.4036363636363636\n", + " },\n", + " \"Humanities\": {\n", + " \"correct\": 100.0,\n", + " \"num\": 257,\n", + " \"score\": 0.38910505836575876\n", + " },\n", + " \"Other\": {\n", + " \"correct\": 140.0,\n", + " \"num\": 384,\n", + " \"score\": 0.3645833333333333\n", + " }\n", + " },\n", + " \"All\": {\n", + " \"score\": 0.36701337295690933,\n", + " \"num\": 1346,\n", + " \"correct\": 494.0\n", + " }\n", + "}" + ] + } + ] + } + ] +} \ No newline at end of file diff --git a/scripts/README.md b/scripts/README.md index 9273e38..241255b 100644 --- a/scripts/README.md +++ b/scripts/README.md @@ -36,6 +36,12 @@ Code for extending Chinese vocabulary, Wiki: https://github.com/ymcui/Chinese-LL Script for merging LLaMA/Alpaca LoRA. Wiki: https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/Manual-Conversion +### merge_llama_with_chinese_lora_low_mem.py + +(推荐)低资源版合并LLaMA/Alpaca LoRA脚本,Wiki: [https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/手动模型合并与转换](https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/手动模型合并与转换) + +(recommended)Script for merging LLaMA/Alpaca LoRA (low-resource version). Wiki: https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/Manual-Conversion + ### crawl_prompt.py 指令数据爬取脚本,Wiki:[https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/训练细节#训练数据](https://github.com/ymcui/Chinese-LLaMA-Alpaca/wiki/训练细节#训练数据)