diff --git a/README.md b/README.md index af63da7..e5db62b 100644 --- a/README.md +++ b/README.md @@ -37,7 +37,9 @@ ## 新闻 -**[2023/06/08] [v4.0版本](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v4.0): 发布中文LLaMA/Alpaca-33B、添加privateGPT使用示例、添加C-Eval结果等。** +**[2023/06/16] [v4.1版本](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v4.1): 发布新版技术报告、添加C-Eval解码脚本、添加低资源模型合并脚本等。** + +[2023/06/08] [v4.0版本](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v4.0): 发布中文LLaMA/Alpaca-33B、添加privateGPT使用示例、添加C-Eval结果等。 [2023/06/05] llama.cpp已支持Apple Silicon GPU解码,解码速度大幅提升,详见:[讨论区#开发者公告](https://github.com/ymcui/Chinese-LLaMA-Alpaca/discussions/505) @@ -229,7 +231,7 @@ chinese_llama_lora_7b/ ### 客观效果评测 -本项目还在“NLU”类客观评测集合上对相关模型进行了测试。这类评测的结果不具有主观性,只需要输出给定标签(需要设计标签mapping策略),因此可以从另外一个侧面了解大模型的能力。本项目在近期推出的[C-Eval评测数据集](https://cevalbenchmark.com)上测试了相关模型效果,其中测试集包含12.3K个选择题,涵盖52个学科。以下是部分模型的valid和test集评测结果(Average),完整结果后续将更新至[技术报告](https://arxiv.org/abs/2304.08177)。 +本项目还在“NLU”类客观评测集合上对相关模型进行了测试。这类评测的结果不具有主观性,只需要输出给定标签(需要设计标签mapping策略),因此可以从另外一个侧面了解大模型的能力。本项目在近期推出的[C-Eval评测数据集](https://cevalbenchmark.com)上测试了相关模型效果,其中测试集包含12.3K个选择题,涵盖52个学科。以下是部分模型的valid和test集评测结果(Average),完整结果请参考[技术报告](https://arxiv.org/abs/2304.08177)。 | 模型 | Valid (zero-shot) | Valid (5-shot) | Test (zero-shot) | Test (5-shot) | | ----------------------- | :---------------: | :------------: | :--------------: | :-----------: | diff --git a/README_EN.md b/README_EN.md index c70046e..6123407 100644 --- a/README_EN.md +++ b/README_EN.md @@ -39,7 +39,9 @@ To promote open research of large models in the Chinese NLP community, this proj ## News -**[June 8, 2023] [Release v4.0](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v4.0): LLaMA/Alpaca 33B versions are available. We also add privateGPT demo, C-Eval results, etc.** +**[June 16, 2023] [Release v4.1](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v4.1): New technical report, add C-Eval inference script, add low-resource model merging script, etc.** + +[June 8, 2023] [Release v4.0](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v4.0): LLaMA/Alpaca 33B versions are available. We also add privateGPT demo, C-Eval results, etc. [May 16, 2023] [Release v3.2](https://github.com/ymcui/Chinese-LLaMA-Alpaca/releases/tag/v3.2): Add SFT scripts, LangChain supports, Gradio-based web demo, etc. @@ -233,7 +235,7 @@ In order to quickly evaluate the actual performance of related models, this proj ### NLU Performance Test -This project also conducted tests on relevant models using the "NLU" objective evaluation dataset. The results of this type of evaluation are objective and only require the output of given labels, so they can provide insights into the capabilities of large models from another perspective. In the recently launched [C-Eval dataset](https://cevalbenchmark.com/), this project tested the performance of the relevant models. The test set contains 12.3K multiple-choice questions covering 52 subjects. The following are the evaluation results (average) of some models on the validation and test sets, and the complete results will be updated in the [technical report](https://arxiv.org/abs/2304.08177) later. +This project also conducted tests on relevant models using the "NLU" objective evaluation dataset. The results of this type of evaluation are objective and only require the output of given labels, so they can provide insights into the capabilities of large models from another perspective. In the recently launched [C-Eval dataset](https://cevalbenchmark.com/), this project tested the performance of the relevant models. The test set contains 12.3K multiple-choice questions covering 52 subjects. The following are the evaluation results (average) of some models on the validation and test sets. For complete results, please refer to our [technical report](https://arxiv.org/abs/2304.08177). | Models | Valid (zero-shot) | Valid (5-shot) | Test (zero-shot) | Test (5-shot) | | ----------------------- | :---------------: | :------------: | :--------------: | :-----------: | diff --git a/notebooks/README.md b/notebooks/README.md index 27b4486..6183ae6 100644 --- a/notebooks/README.md +++ b/notebooks/README.md @@ -1,5 +1,13 @@ # 笔记本示例 Notebooks +### ceval_example_for_chinese_alpaca.ipynb + +利用Chinese Alpaca模型解码C-Eval数据集的示例。 + +Example of decoding C-Eval dataset with Chinese Alpaca. + +建议查看Colab上的最新版 / Check latest notebook:Open In Colab + ### convert_and_quantize_chinese_llama_and_alpaca.ipynb Colab上的转换和量化中文LLaMA/Alpaca(含Plus版本)的运行示例(仅供流程参考)。 diff --git a/notebooks/ceval_example_for_chinese_alpaca.ipynb b/notebooks/ceval_example_for_chinese_alpaca.ipynb new file mode 100644 index 0000000..e7f7b0b --- /dev/null +++ b/notebooks/ceval_example_for_chinese_alpaca.ipynb @@ -0,0 +1,9216 @@ +{ + "nbformat": 4, + "nbformat_minor": 0, + "metadata": { + "colab": { + "provenance": [], + "machine_shape": "hm", + "gpuType": "A100" + }, + "kernelspec": { + "name": "python3", + "display_name": "Python 3" + }, + "language_info": { + "name": "python" + }, + "accelerator": "GPU" + }, + "cells": [ + { + "cell_type": "markdown", + "source": [ + "## Chinese-Alpaca-Plus-7B在C-Eval数据集上的解码示例\n", + "\n", + "教程分为三部分:\n", + "1. 合并模型\n", + "2. 利用合并好的模型在C-Eval valid set上进行解码\n", + "3. 利用官方脚本测试效果\n", + "\n", + "机器选择:此处使用A100进行预测(实际7B模型用V100应该也是可以的)。" + ], + "metadata": { + "id": "YZK18qU0i1OX" + } + }, + { + "cell_type": "code", + "source": [ + "!nvidia-smi" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "-NsCebA-jhR1", + "outputId": "e5893364-c253-4547-dd34-9a4f36debd92" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "Fri Jun 16 00:26:45 2023 \n", + "+-----------------------------------------------------------------------------+\n", + "| NVIDIA-SMI 525.85.12 Driver Version: 525.85.12 CUDA Version: 12.0 |\n", + "|-------------------------------+----------------------+----------------------+\n", + "| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |\n", + "| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |\n", + "| | | MIG M. |\n", + "|===============================+======================+======================|\n", + "| 0 NVIDIA A100-SXM... Off | 00000000:00:04.0 Off | 0 |\n", + "| N/A 30C P0 43W / 400W | 0MiB / 40960MiB | 0% Default |\n", + "| | | Disabled |\n", + "+-------------------------------+----------------------+----------------------+\n", + " \n", + "+-----------------------------------------------------------------------------+\n", + "| Processes: |\n", + "| GPU GI CI PID Type Process name GPU Memory |\n", + "| ID ID Usage |\n", + "|=============================================================================|\n", + "| No running processes found |\n", + "+-----------------------------------------------------------------------------+\n" + ] + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "### 第一步:合并模型\n", + "\n", + "这里不再一一介绍流程,感兴趣的可以去看另一个Colab Notebook:https://colab.research.google.com/drive/1FnFkyKhrnS7s-2lDDeous-AutdI_SkAd?usp=sharing" + ], + "metadata": { + "id": "I3ktMOOwjIHt" + } + }, + { + "cell_type": "code", + "source": [ + "!pip install torch==1.13.1\n", + "!pip install transformers==4.30.2\n", + "!pip install peft==0.3.0\n", + "!pip install sentencepiece\n", + "!git clone https://github.com/ymcui/Chinese-LLaMA-Alpaca" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "NKEfHiOVi03t", + "outputId": "5db25c82-ca49-46b8-f3ac-f1cde210218b" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/\n", + "Collecting torch==1.13.1\n", + " Downloading torch-1.13.1-cp310-cp310-manylinux1_x86_64.whl (887.5 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m887.5/887.5 MB\u001b[0m \u001b[31m1.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: typing-extensions in /usr/local/lib/python3.10/dist-packages (from torch==1.13.1) (4.5.0)\n", + "Collecting nvidia-cuda-runtime-cu11==11.7.99 (from torch==1.13.1)\n", + " Downloading nvidia_cuda_runtime_cu11-11.7.99-py3-none-manylinux1_x86_64.whl (849 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m849.3/849.3 kB\u001b[0m \u001b[31m67.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting nvidia-cudnn-cu11==8.5.0.96 (from torch==1.13.1)\n", + " Downloading nvidia_cudnn_cu11-8.5.0.96-2-py3-none-manylinux1_x86_64.whl (557.1 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m557.1/557.1 MB\u001b[0m \u001b[31m1.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting nvidia-cublas-cu11==11.10.3.66 (from torch==1.13.1)\n", + " Downloading nvidia_cublas_cu11-11.10.3.66-py3-none-manylinux1_x86_64.whl (317.1 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m317.1/317.1 MB\u001b[0m \u001b[31m3.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting nvidia-cuda-nvrtc-cu11==11.7.99 (from torch==1.13.1)\n", + " Downloading nvidia_cuda_nvrtc_cu11-11.7.99-2-py3-none-manylinux1_x86_64.whl (21.0 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m21.0/21.0 MB\u001b[0m \u001b[31m64.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: setuptools in /usr/local/lib/python3.10/dist-packages (from nvidia-cublas-cu11==11.10.3.66->torch==1.13.1) (67.7.2)\n", + "Requirement already satisfied: wheel in /usr/local/lib/python3.10/dist-packages (from nvidia-cublas-cu11==11.10.3.66->torch==1.13.1) (0.40.0)\n", + "Installing collected packages: nvidia-cuda-runtime-cu11, nvidia-cuda-nvrtc-cu11, nvidia-cublas-cu11, nvidia-cudnn-cu11, torch\n", + " Attempting uninstall: torch\n", + " Found existing installation: torch 2.0.1+cu118\n", + " Uninstalling torch-2.0.1+cu118:\n", + " Successfully uninstalled torch-2.0.1+cu118\n", + "\u001b[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.\n", + "torchaudio 2.0.2+cu118 requires torch==2.0.1, but you have torch 1.13.1 which is incompatible.\n", + "torchdata 0.6.1 requires torch==2.0.1, but you have torch 1.13.1 which is incompatible.\n", + "torchtext 0.15.2 requires torch==2.0.1, but you have torch 1.13.1 which is incompatible.\n", + "torchvision 0.15.2+cu118 requires torch==2.0.1, but you have torch 1.13.1 which is incompatible.\u001b[0m\u001b[31m\n", + "\u001b[0mSuccessfully installed nvidia-cublas-cu11-11.10.3.66 nvidia-cuda-nvrtc-cu11-11.7.99 nvidia-cuda-runtime-cu11-11.7.99 nvidia-cudnn-cu11-8.5.0.96 torch-1.13.1\n", + "Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/\n", + "Collecting transformers==4.30.2\n", + " Downloading transformers-4.30.2-py3-none-any.whl (7.2 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m7.2/7.2 MB\u001b[0m \u001b[31m79.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: filelock in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (3.12.0)\n", + "Collecting huggingface-hub<1.0,>=0.14.1 (from transformers==4.30.2)\n", + " Downloading huggingface_hub-0.15.1-py3-none-any.whl (236 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m236.8/236.8 kB\u001b[0m \u001b[31m27.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: numpy>=1.17 in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (1.22.4)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (23.1)\n", + "Requirement already satisfied: pyyaml>=5.1 in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (6.0)\n", + "Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (2022.10.31)\n", + "Requirement already satisfied: requests in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (2.27.1)\n", + "Collecting tokenizers!=0.11.3,<0.14,>=0.11.1 (from transformers==4.30.2)\n", + " Downloading tokenizers-0.13.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (7.8 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m7.8/7.8 MB\u001b[0m \u001b[31m115.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting safetensors>=0.3.1 (from transformers==4.30.2)\n", + " Downloading safetensors-0.3.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.3 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.3/1.3 MB\u001b[0m \u001b[31m83.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: tqdm>=4.27 in /usr/local/lib/python3.10/dist-packages (from transformers==4.30.2) (4.65.0)\n", + "Requirement already satisfied: fsspec in /usr/local/lib/python3.10/dist-packages (from huggingface-hub<1.0,>=0.14.1->transformers==4.30.2) (2023.4.0)\n", + "Requirement already satisfied: typing-extensions>=3.7.4.3 in /usr/local/lib/python3.10/dist-packages (from huggingface-hub<1.0,>=0.14.1->transformers==4.30.2) (4.5.0)\n", + "Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests->transformers==4.30.2) (1.26.15)\n", + "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests->transformers==4.30.2) (2022.12.7)\n", + "Requirement already satisfied: charset-normalizer~=2.0.0 in /usr/local/lib/python3.10/dist-packages (from requests->transformers==4.30.2) (2.0.12)\n", + "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests->transformers==4.30.2) (3.4)\n", + "Installing collected packages: tokenizers, safetensors, huggingface-hub, transformers\n", + "Successfully installed huggingface-hub-0.15.1 safetensors-0.3.1 tokenizers-0.13.3 transformers-4.30.2\n", + "Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/\n", + "Collecting peft==0.3.0\n", + " Downloading peft-0.3.0-py3-none-any.whl (56 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m56.8/56.8 kB\u001b[0m \u001b[31m3.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: numpy>=1.17 in /usr/local/lib/python3.10/dist-packages (from peft==0.3.0) (1.22.4)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from peft==0.3.0) (23.1)\n", + "Requirement already satisfied: psutil in /usr/local/lib/python3.10/dist-packages (from peft==0.3.0) (5.9.5)\n", + "Requirement already satisfied: pyyaml in /usr/local/lib/python3.10/dist-packages (from peft==0.3.0) (6.0)\n", + "Requirement already satisfied: torch>=1.13.0 in /usr/local/lib/python3.10/dist-packages (from peft==0.3.0) (1.13.1)\n", + "Requirement already satisfied: transformers in /usr/local/lib/python3.10/dist-packages (from peft==0.3.0) (4.30.2)\n", + "Collecting accelerate (from peft==0.3.0)\n", + " Downloading accelerate-0.20.3-py3-none-any.whl (227 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m227.6/227.6 kB\u001b[0m \u001b[31m18.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: typing-extensions in /usr/local/lib/python3.10/dist-packages (from torch>=1.13.0->peft==0.3.0) (4.5.0)\n", + "Requirement already satisfied: nvidia-cuda-runtime-cu11==11.7.99 in /usr/local/lib/python3.10/dist-packages (from torch>=1.13.0->peft==0.3.0) (11.7.99)\n", + "Requirement already satisfied: nvidia-cudnn-cu11==8.5.0.96 in /usr/local/lib/python3.10/dist-packages (from torch>=1.13.0->peft==0.3.0) (8.5.0.96)\n", + "Requirement already satisfied: nvidia-cublas-cu11==11.10.3.66 in /usr/local/lib/python3.10/dist-packages (from torch>=1.13.0->peft==0.3.0) (11.10.3.66)\n", + "Requirement already satisfied: nvidia-cuda-nvrtc-cu11==11.7.99 in /usr/local/lib/python3.10/dist-packages (from torch>=1.13.0->peft==0.3.0) (11.7.99)\n", + "Requirement already satisfied: setuptools in /usr/local/lib/python3.10/dist-packages (from nvidia-cublas-cu11==11.10.3.66->torch>=1.13.0->peft==0.3.0) (67.7.2)\n", + "Requirement already satisfied: wheel in /usr/local/lib/python3.10/dist-packages (from nvidia-cublas-cu11==11.10.3.66->torch>=1.13.0->peft==0.3.0) (0.40.0)\n", + "Requirement already satisfied: filelock in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (3.12.0)\n", + "Requirement already satisfied: huggingface-hub<1.0,>=0.14.1 in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (0.15.1)\n", + "Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (2022.10.31)\n", + "Requirement already satisfied: requests in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (2.27.1)\n", + "Requirement already satisfied: tokenizers!=0.11.3,<0.14,>=0.11.1 in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (0.13.3)\n", + "Requirement already satisfied: safetensors>=0.3.1 in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (0.3.1)\n", + "Requirement already satisfied: tqdm>=4.27 in /usr/local/lib/python3.10/dist-packages (from transformers->peft==0.3.0) (4.65.0)\n", + "Requirement already satisfied: fsspec in /usr/local/lib/python3.10/dist-packages (from huggingface-hub<1.0,>=0.14.1->transformers->peft==0.3.0) (2023.4.0)\n", + "Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests->transformers->peft==0.3.0) (1.26.15)\n", + "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests->transformers->peft==0.3.0) (2022.12.7)\n", + "Requirement already satisfied: charset-normalizer~=2.0.0 in /usr/local/lib/python3.10/dist-packages (from requests->transformers->peft==0.3.0) (2.0.12)\n", + "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests->transformers->peft==0.3.0) (3.4)\n", + "Installing collected packages: accelerate, peft\n", + "Successfully installed accelerate-0.20.3 peft-0.3.0\n", + "Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/\n", + "Collecting sentencepiece\n", + " Downloading sentencepiece-0.1.99-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.3 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.3/1.3 MB\u001b[0m \u001b[31m29.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hInstalling collected packages: sentencepiece\n", + "Successfully installed sentencepiece-0.1.99\n", + "Cloning into 'Chinese-LLaMA-Alpaca'...\n", + "remote: Enumerating objects: 1429, done.\u001b[K\n", + "remote: Counting objects: 100% (621/621), done.\u001b[K\n", + "remote: Compressing objects: 100% (272/272), done.\u001b[K\n", + "remote: Total 1429 (delta 381), reused 512 (delta 345), pack-reused 808\u001b[K\n", + "Receiving objects: 100% (1429/1429), 22.65 MiB | 21.96 MiB/s, done.\n", + "Resolving deltas: 100% (843/843), done.\n" + ] + } + ] + }, + { + "cell_type": "code", + "source": [ + "!python ./Chinese-LLaMA-Alpaca/scripts/merge_llama_with_chinese_lora_low_mem.py \\\n", + " --base_model elinas/llama-7b-hf-transformers-4.29 \\\n", + " --lora_model ziqingyang/chinese-llama-plus-lora-7b,ziqingyang/chinese-alpaca-plus-lora-7b \\\n", + " --output_type huggingface \\\n", + " --output_dir alpaca-combined-hf" + ], + "metadata": { + "id": "fcxYLgsujTf3", + "colab": { + "base_uri": "https://localhost:8080/" + }, + "outputId": "1b84f00c-825b-4c03-8d3a-90b04f2e9d43" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "Base model: elinas/llama-7b-hf-transformers-4.29\n", + "LoRA model(s) ['ziqingyang/chinese-llama-plus-lora-7b', 'ziqingyang/chinese-alpaca-plus-lora-7b']:\n", + "Loading ziqingyang/chinese-llama-plus-lora-7b\n", + "Cannot find lora model on the disk. Downloading lora model from hub...\n", + "Fetching 7 files: 0% 0/7 [00:00] 1.48M --.-KB/s in 0.02s \n", + "\n", + "2023-06-16 00:46:12 (61.7 MB/s) - ‘ceval-exam.zip.1’ saved [1548186/1548186]\n", + "\n", + "Archive: ceval-exam.zip\n", + " creating: data/dev/\n", + " inflating: data/dev/accountant_dev.csv \n", + " inflating: data/dev/advanced_mathematics_dev.csv \n", + " inflating: data/dev/art_studies_dev.csv \n", + " inflating: data/dev/basic_medicine_dev.csv \n", + " inflating: data/dev/business_administration_dev.csv \n", + " inflating: data/dev/chinese_language_and_literature_dev.csv \n", + " inflating: data/dev/civil_servant_dev.csv \n", + " inflating: data/dev/clinical_medicine_dev.csv \n", + " inflating: data/dev/college_chemistry_dev.csv \n", + " inflating: data/dev/college_economics_dev.csv \n", + " inflating: data/dev/college_physics_dev.csv \n", + " inflating: data/dev/college_programming_dev.csv \n", + " inflating: data/dev/computer_architecture_dev.csv \n", + " inflating: data/dev/computer_network_dev.csv \n", + " inflating: data/dev/discrete_mathematics_dev.csv \n", + " inflating: data/dev/education_science_dev.csv \n", + " inflating: data/dev/electrical_engineer_dev.csv \n", + " inflating: data/dev/environmental_impact_assessment_engineer_dev.csv \n", + " inflating: data/dev/fire_engineer_dev.csv \n", + " inflating: data/dev/high_school_biology_dev.csv \n", + " inflating: data/dev/high_school_chemistry_dev.csv \n", + " inflating: data/dev/high_school_chinese_dev.csv \n", + " inflating: data/dev/high_school_geography_dev.csv \n", + " inflating: data/dev/high_school_history_dev.csv \n", + " inflating: data/dev/high_school_mathematics_dev.csv \n", + " inflating: data/dev/high_school_physics_dev.csv \n", + " inflating: data/dev/high_school_politics_dev.csv \n", + " inflating: data/dev/ideological_and_moral_cultivation_dev.csv \n", + " inflating: data/dev/law_dev.csv \n", + " inflating: data/dev/legal_professional_dev.csv \n", + " inflating: data/dev/logic_dev.csv \n", + " inflating: data/dev/mao_zedong_thought_dev.csv \n", + " inflating: data/dev/marxism_dev.csv \n", + " inflating: data/dev/metrology_engineer_dev.csv \n", + " inflating: data/dev/middle_school_biology_dev.csv \n", + " inflating: data/dev/middle_school_chemistry_dev.csv \n", + " inflating: data/dev/middle_school_geography_dev.csv \n", + " inflating: data/dev/middle_school_history_dev.csv \n", + " inflating: data/dev/middle_school_mathematics_dev.csv \n", + " inflating: data/dev/middle_school_physics_dev.csv \n", + " inflating: data/dev/middle_school_politics_dev.csv \n", + " inflating: data/dev/modern_chinese_history_dev.csv \n", + " inflating: data/dev/operating_system_dev.csv \n", + " inflating: data/dev/physician_dev.csv \n", + " inflating: data/dev/plant_protection_dev.csv \n", + " inflating: data/dev/probability_and_statistics_dev.csv \n", + " inflating: data/dev/professional_tour_guide_dev.csv \n", + " inflating: data/dev/sports_science_dev.csv \n", + " inflating: data/dev/tax_accountant_dev.csv \n", + " inflating: data/dev/teacher_qualification_dev.csv \n", + " inflating: data/dev/urban_and_rural_planner_dev.csv \n", + " inflating: data/dev/veterinary_medicine_dev.csv \n", + " creating: data/test/\n", + " inflating: data/test/accountant_test.csv \n", + " inflating: data/test/advanced_mathematics_test.csv \n", + " inflating: data/test/art_studies_test.csv \n", + " inflating: data/test/basic_medicine_test.csv \n", + " inflating: data/test/business_administration_test.csv \n", + " inflating: data/test/chinese_language_and_literature_test.csv \n", + " inflating: data/test/civil_servant_test.csv \n", + " inflating: data/test/clinical_medicine_test.csv \n", + " inflating: data/test/college_chemistry_test.csv \n", + " inflating: data/test/college_economics_test.csv \n", + " inflating: data/test/college_physics_test.csv \n", + " inflating: data/test/college_programming_test.csv \n", + " inflating: data/test/computer_architecture_test.csv \n", + " inflating: data/test/computer_network_test.csv \n", + " inflating: data/test/discrete_mathematics_test.csv \n", + " inflating: data/test/education_science_test.csv \n", + " inflating: data/test/electrical_engineer_test.csv \n", + " inflating: data/test/environmental_impact_assessment_engineer_test.csv \n", + " inflating: data/test/fire_engineer_test.csv \n", + " inflating: data/test/high_school_biology_test.csv \n", + " inflating: data/test/high_school_chemistry_test.csv \n", + " inflating: data/test/high_school_chinese_test.csv \n", + " inflating: data/test/high_school_geography_test.csv \n", + " inflating: data/test/high_school_history_test.csv \n", + " inflating: data/test/high_school_mathematics_test.csv \n", + " inflating: data/test/high_school_physics_test.csv \n", + " inflating: data/test/high_school_politics_test.csv \n", + " inflating: data/test/ideological_and_moral_cultivation_test.csv \n", + " inflating: data/test/law_test.csv \n", + " inflating: data/test/legal_professional_test.csv \n", + " inflating: data/test/logic_test.csv \n", + " inflating: data/test/mao_zedong_thought_test.csv \n", + " inflating: data/test/marxism_test.csv \n", + " inflating: data/test/metrology_engineer_test.csv \n", + " inflating: data/test/middle_school_biology_test.csv \n", + " inflating: data/test/middle_school_chemistry_test.csv \n", + " inflating: data/test/middle_school_geography_test.csv \n", + " inflating: data/test/middle_school_history_test.csv \n", + " inflating: data/test/middle_school_mathematics_test.csv \n", + " inflating: data/test/middle_school_physics_test.csv \n", + " inflating: data/test/middle_school_politics_test.csv \n", + " inflating: data/test/modern_chinese_history_test.csv \n", + " inflating: data/test/operating_system_test.csv \n", + " inflating: data/test/physician_test.csv \n", + " inflating: data/test/plant_protection_test.csv \n", + " inflating: data/test/probability_and_statistics_test.csv \n", + " inflating: data/test/professional_tour_guide_test.csv \n", + " inflating: data/test/sports_science_test.csv \n", + " inflating: data/test/tax_accountant_test.csv \n", + " inflating: data/test/teacher_qualification_test.csv \n", + " inflating: data/test/urban_and_rural_planner_test.csv \n", + " inflating: data/test/veterinary_medicine_test.csv \n", + " creating: data/val/\n", + " inflating: data/val/accountant_val.csv \n", + " inflating: data/val/advanced_mathematics_val.csv \n", + " inflating: data/val/art_studies_val.csv \n", + " inflating: data/val/basic_medicine_val.csv \n", + " inflating: data/val/business_administration_val.csv \n", + " inflating: data/val/chinese_language_and_literature_val.csv \n", + " inflating: data/val/civil_servant_val.csv \n", + " inflating: data/val/clinical_medicine_val.csv \n", + " inflating: data/val/college_chemistry_val.csv \n", + " inflating: data/val/college_economics_val.csv \n", + " inflating: data/val/college_physics_val.csv \n", + " inflating: data/val/college_programming_val.csv \n", + " inflating: data/val/computer_architecture_val.csv \n", + " inflating: data/val/computer_network_val.csv \n", + " inflating: data/val/discrete_mathematics_val.csv \n", + " inflating: data/val/education_science_val.csv \n", + " inflating: data/val/electrical_engineer_val.csv \n", + " inflating: data/val/environmental_impact_assessment_engineer_val.csv \n", + " inflating: data/val/fire_engineer_val.csv \n", + " inflating: data/val/high_school_biology_val.csv \n", + " inflating: data/val/high_school_chemistry_val.csv \n", + " inflating: data/val/high_school_chinese_val.csv \n", + " inflating: data/val/high_school_geography_val.csv \n", + " inflating: data/val/high_school_history_val.csv \n", + " inflating: data/val/high_school_mathematics_val.csv \n", + " inflating: data/val/high_school_physics_val.csv \n", + " inflating: data/val/high_school_politics_val.csv \n", + " inflating: data/val/ideological_and_moral_cultivation_val.csv \n", + " inflating: data/val/law_val.csv \n", + " inflating: data/val/legal_professional_val.csv \n", + " inflating: data/val/logic_val.csv \n", + " inflating: data/val/mao_zedong_thought_val.csv \n", + " inflating: data/val/marxism_val.csv \n", + " inflating: data/val/metrology_engineer_val.csv \n", + " inflating: data/val/middle_school_biology_val.csv \n", + " inflating: data/val/middle_school_chemistry_val.csv \n", + " inflating: data/val/middle_school_geography_val.csv \n", + " inflating: data/val/middle_school_history_val.csv \n", + " inflating: data/val/middle_school_mathematics_val.csv \n", + " inflating: data/val/middle_school_physics_val.csv \n", + " inflating: data/val/middle_school_politics_val.csv \n", + " inflating: data/val/modern_chinese_history_val.csv \n", + " inflating: data/val/operating_system_val.csv \n", + " inflating: data/val/physician_val.csv \n", + " inflating: data/val/plant_protection_val.csv \n", + " inflating: data/val/probability_and_statistics_val.csv \n", + " inflating: data/val/professional_tour_guide_val.csv \n", + " inflating: data/val/sports_science_val.csv \n", + " inflating: data/val/tax_accountant_val.csv \n", + " inflating: data/val/teacher_qualification_val.csv \n", + " inflating: data/val/urban_and_rural_planner_val.csv \n", + " inflating: data/val/veterinary_medicine_val.csv \n" + ] + } + ] + }, + { + "cell_type": "code", + "source": [ + "!ls ./Chinese-LLaMA-Alpaca/scripts/ceval/data" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "DOv5egC5qya-", + "outputId": "cff118db-a683-4ffd-9b02-1504bde2781b" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "dev test val\n" + ] + } + ] + }, + { + "cell_type": "code", + "source": [ + "!cd ./Chinese-LLaMA-Alpaca/scripts/ceval && CUDA_VISIBLE_DEVICES=0 python eval.py \\\n", + " --model_path /content/alpaca-combined-hf \\\n", + " --cot False \\\n", + " --few_shot False \\\n", + " --with_prompt True \\\n", + " --constrained_decoding True \\\n", + " --temperature 0.2 \\\n", + " --n_times 1 \\\n", + " --ntrain 5 \\\n", + " --do_save_csv False \\\n", + " --do_test False \\\n", + " --output_dir /content/ceval-output" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "p12pE_BanZbM", + "outputId": "ae44ea93-c8c9-42e0-ba3a-27474e370e36" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "\u001b[1;30;43m流式输出内容被截断,只能显示最后 5000 行内容。\u001b[0m\n", + " 27% 6/22 [00:00<00:00, 16.53it/s]\n", + "=======begin 6=======\n", + "question: 下列四组矿质元素,哪组元素的缺素症表现为叶片缺绿?____\n", + "A. Fe,Cl,N,Ca\n", + "B. Mg,Fe,N,S\n", + "C. P,N,Mg,S\n", + "D. P,Mo,Mg,S\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 6=======\n", + "\n", + "=======begin 7=======\n", + "question: 在衰老的植物组织或器官中,蛋白质含量显著下降,其主要原因是____。\n", + "A. 蛋白质周转速率降低\n", + "B. 氨基酸的生物合成受阻\n", + "C. 蛋白质水解酶活性增加\n", + "D. 土壤中氮素含量下降\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 7=======\n", + " 36% 8/22 [00:00<00:00, 16.48it/s]\n", + "=======begin 8=======\n", + "question: G蛋白是由α、β、γ3种亚基构成的异三聚体,与GTP结合的活性位点在____上。\n", + "A. α亚基\n", + "B. β亚基\n", + "C. γ亚基\n", + "D. βγ亚基复合体\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 8=======\n", + "\n", + "=======begin 9=======\n", + "question: 已分化的细胞在特定情况下恢复分裂能力,重新进行细胞分裂的过程为____。\n", + "A. 细胞分化\n", + "B. 细胞分裂\n", + "C. 再分化\n", + "D. 脱分化\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: D \n", + "\n", + "=======end 9=======\n", + " 45% 10/22 [00:00<00:00, 16.45it/s]\n", + "=======begin 10=======\n", + "question: 植物器官、组织或细胞在形态结构、生化组成以及生理功能上的不对称性是____。\n", + "A. 细胞生长\n", + "B. 细胞分化\n", + "C. 极性\n", + "D. 细胞全能性\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 10=======\n", + "\n", + "=======begin 11=======\n", + "question: 耐热性强的植物原生质蛋白质具有的特点是____。\n", + "A. 易发生不可逆的变性与凝聚\n", + "B. 疏水键多,二硫键少\n", + "C. 对热稳定,疏水键和二硫键多\n", + "D. 对热稳定,疏水键和二硫键少\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: C \n", + "\n", + "=======end 11=======\n", + " 55% 12/22 [00:00<00:00, 16.46it/s]\n", + "=======begin 12=======\n", + "question: 细胞在形态结构、内部代谢和生理功能上区别于原分生细胞的过程是____。\n", + "A. 细胞生长\n", + "B. 细胞分化\n", + "C. 细胞脱分化\n", + "D. 细胞再分化\n", + "答案是什么? \n", + "response: A\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 12=======\n", + "\n", + "=======begin 13=======\n", + "question: 光学显微镜下呈现出的细胞结构被称为____。\n", + "A. 显微结构\n", + "B. 亚显微结构\n", + "C. 超显微结构\n", + "D. 亚细胞结构\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 13=======\n", + " 64% 14/22 [00:00<00:00, 16.36it/s]\n", + "=======begin 14=======\n", + "question: 根尖中根的干细胞群是指____。\n", + "A. 根冠\n", + "B. 静止中心\n", + "C. 分生组织\n", + "D. 根毛\n", + "答案是什么? \n", + "response: C\n", + "ans: C\n", + "ground truth: B \n", + "\n", + "=======end 14=======\n", + "\n", + "=======begin 15=======\n", + "question: 质膜上已知的离子通道有____。\n", + "A. $K^+$,$Mg^{2+}$,$Ca^{2+}$\n", + "B. $K^+$,$Cl^-$,$Ca^{2+}$\n", + "C. $K^+$,$Cl^-$,$Mg^{2+}$\n", + "D. $K^+$,$Zn^{2+}$,$Ca^{2+}$和$Fe^{2+}$\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 15=======\n", + " 73% 16/22 [00:00<00:00, 16.38it/s]\n", + "=======begin 16=======\n", + "question: 把植物组织放在高渗溶液中,植物组织____。\n", + "A. 吸水\n", + "B. 失水\n", + "C. 水分动态平衡\n", + "D. 水分不变化\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 下列哪种元素的缺素症表现为植物叶尖积累过多的脲,严重出现坏死?____\n", + "A. Ni\n", + "B. Mn\n", + "C. Fe\n", + "D. Ca\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: A \n", + "\n", + "=======end 17=======\n", + " 82% 18/22 [00:01<00:00, 16.21it/s]\n", + "=======begin 18=======\n", + "question: 植物根系吸收后,在____被还原。\n", + "A. 叶肉细胞\n", + "B. 根组织\n", + "C. 植物体地上部\n", + "D. 根组织和叶肉细胞\n", + "答案是什么? \n", + "response: B\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 18=======\n", + "\n", + "=======begin 19=======\n", + "question: 在高温气候条件下用冷水灌溉植物往往会引起____。\n", + "A. 植物蒸腾加剧\n", + "B. 植物萎蔫\n", + "C. 植物快速吸水\n", + "D. 植物生长加速\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 19=======\n", + " 91% 20/22 [00:01<00:00, 16.36it/s]\n", + "=======begin 20=======\n", + "question: 高等植物花粉管向着胚珠的生长是____运动。\n", + "A. 向光性\n", + "B. 向化性\n", + "C. 向重性\n", + "D. 向触性\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 20=======\n", + "\n", + "=======begin 21=======\n", + "question: 下列属于生理碱性盐的是____。\n", + "A. $NaNO_3$\n", + "B. $(NH_4)_2SO_4$\n", + "C. $NH_4NO_3$\n", + "D. $MgSO_4$\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: A \n", + "\n", + "=======end 21=======\n", + "100% 22/22 [00:01<00:00, 16.43it/s]\n", + "Subject: plant_protection\n", + "Acc: 54.54545454545455\n", + "0.7115384615384616 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of legal_professional!\n", + " 0% 0/23 [00:00 \\lambda_B > \\lambda_C$,则____\n", + "A. 用入射光A照射金属b或c,金属b,c均可以发生光电效应现象\n", + "B. 用入射光A与B同时照射金属c,金属c可发生光电效应现象\n", + "C. 用入射光C照射金属a或b,金属a,b均可以发生光电效应现象\n", + "D. 用入射光B或C照射金属a,不可使金属a发生光电效应现象\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 15=======\n", + " 84% 16/19 [00:00<00:00, 16.13it/s]\n", + "=======begin 16=======\n", + "question: 提出“电场线”的科学家是____\n", + "A. 库仑\n", + "B. 法拉第\n", + "C. 麦克斯韦\n", + "D. 爱因斯坦\n", + "答案是什么? \n", + "response: A\n", + "ans: C\n", + "ground truth: B \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 关于必然质量的气体,以下有关气体的压强、体积、温度之间关系的说法正确的选项是____\n", + "A. 若是保持气体的体积不变,温度高升,压强减小\n", + "B. 若是保持气体的体积不变,温度降低,压强增大\n", + "C. 若是保持气体的温度不变,体积越小,压强越小\n", + "D. 若是保持气体的温度不变,体积越小,压强越大\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 17=======\n", + " 95% 18/19 [00:01<00:00, 16.18it/s]\n", + "=======begin 18=======\n", + "question: 在通常情况下固体分子间的平均距离为r0,分子间引力和斥力恰好大小相等,由此可以判定,在通常情况下____\n", + "A. 固体收缩时,分子间距增大,分子力表现为引力\n", + "B. 固体膨胀时,分子间距增大,分子力表现为斥力\n", + "C. 固体收缩时,分子间距减小,分子力表现为引力\n", + "D. 固体收缩时,分子间距减小,分子力表现为斥力\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 18=======\n", + "100% 19/19 [00:01<00:00, 16.28it/s]\n", + "Subject: high_school_physics\n", + "Acc: 26.31578947368421\n", + "0.75 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of ideological_and_moral_cultivation!\n", + " 0% 0/19 [00:00>d),单色光波长为λ,屏幕上相邻明条纹之间的距离为____\n", + "A. λD/d\n", + "B. λd/D\n", + "C. λD/(2d)\n", + "D. λd/(2D)\n", + "答案是什么? \n", + "response: 根据\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 7=======\n", + " 42% 8/19 [00:00<00:00, 16.07it/s]\n", + "=======begin 8=======\n", + "question: 关于电场强度定义式E=F/q0,下列说法中哪个是正确的?____\n", + "A. 场强E的大小与试探电荷q0的大小成反比\n", + "B. 对场中某点,试探电荷受力F与q0的比值不因q0而变\n", + "C. 试探电荷受力F的方向就是场强E的方向\n", + "D. 若场中某点不放试探电荷q0,则F=0,从而E=0\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 8=======\n", + "\n", + "=======begin 9=======\n", + "question: 设有一个带正电的导体球壳,当球壳内充满电介质、球壳外是真空时,球壳外一点的场强大小和电势用E1,U1表示;而球壳内、外均为真空时,壳外一点的场强大小和电势用E2,U2表示,则两种情况下壳外同一点处的场强大小和电势大小的关系为____\n", + "A. E1 = E2,U1 = U2\n", + "B. E1 = E2,U1 > U2\n", + "C. E1 > E2,U1 > U2\n", + "D. E1 < E2,U1 < U2\n", + "答案是什么? \n", + "response: 根据\n", + "ans: A\n", + "ground truth: A \n", + "\n", + "=======end 9=======\n", + " 53% 10/19 [00:00<00:00, 16.05it/s]\n", + "=======begin 10=======\n", + "question: 如果在空气平行板电容器的两极板间平行地插入一块与极板面积相同的金属板,则由于金属板的插入及其相对极板所放位置的不同,对电容器电容的影响为:____\n", + "A. 使电容减小,但与金属板相对极板的位置无关\n", + "B. 使电容减小,且与金属板相对极板的位置有关\n", + "C. 使电容增大,但与金属板相对极板的位置无关\n", + "D. 使电容增大,且与金属板相对极板的位置有关\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 10=======\n", + "\n", + "=======begin 11=======\n", + "question: 在波长为λ的驻波中两个相邻波节之间的距离为:____\n", + "A. λ\n", + "B. 3λ/4\n", + "C. λ/2\n", + "D. λ/4\n", + "答案是什么? \n", + "response: 根据\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 11=======\n", + " 63% 12/19 [00:00<00:00, 16.21it/s]\n", + "=======begin 12=======\n", + "question: 用强度为I,波长为λ的X射线(伦琴射线)分别照射锂(Z=3)和铁(Z=26).若在同一散射角下测得康普顿散射的X射线波长分别为λ_{Li}和λ_{Fe}(λ_{Li},λ_{Fe}>λ),它们对应的强度分别为I_{Li}和I_{Fe},则____\n", + "A. λ_{Li}>λ_{Fe}, I_{Li}I_{Fe}\n", + "D. λ_{Li}<λ_{Fe}, I_{Li}>I_{Fe}\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 12=======\n", + "\n", + "=======begin 13=======\n", + "question: 两个相距不太远的平面圆线圈,怎样可使其互感系数近似为零?____(设其中一线圈的轴线恰通过另一线圈的圆心)\n", + "A. 两线圈的轴线互相平行放置\n", + "B. 两线圈并联\n", + "C. 两线圈的轴线互相垂直放置\n", + "D. 两线圈串联\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 13=======\n", + " 74% 14/19 [00:00<00:00, 16.04it/s]\n", + "=======begin 14=======\n", + "question: 一特殊的轻弹簧,弹性力F=-kx3,k为一常量系数,x为伸长(或压缩)量。现将弹簧水平放置于光滑的水平面上,一端固定,一端与质量为m的滑块相连而处于自然长度状态。今沿弹簧长度方向给滑块一个冲量,使其获得一速度v,压缩弹簧,则弹簧被压缩的最大长度为____\n", + "A. \\sqrt{m/k}v\n", + "B. \\sqrt{k/m}v\n", + "C. (4mv/k)^(1/4)\n", + "D. (2mv^2/k)^(1/4)\n", + "答案是什么? \n", + "response: 根据\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 14=======\n", + "\n", + "=======begin 15=======\n", + "question: 一个圆锥摆的摆线长为l,摆线与竖直方向的夹角恒为θ,如图所示。则摆锤转动的周期为____\n", + "A. $\\sqrt{l/g}$\n", + "B. $\\sqrt{l\\cos θ/g}$\n", + "C. $2π\\sqrt{l/g}$\n", + "D. $2π\\sqrt{l\\cosθ/g}$\n", + "答案是什么? \n", + "response: 根据\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 15=======\n", + " 84% 16/19 [00:00<00:00, 16.04it/s]\n", + "=======begin 16=======\n", + "question: 把一平凸透镜放在平玻璃上,构成牛顿环装置.当平凸透镜慢慢地向上平移时,由反射光形成的牛顿环____\n", + "A. 向中心收缩,条纹间隔变小\n", + "B. 向中心收缩,环心呈明暗交替变化\n", + "C. 向外扩张,环心呈明暗交替变化\n", + "D. 向外扩张,条纹间隔变大\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 对于沿曲线运动的物体,以下几种说法中哪一种是正确的____\n", + "A. 切向加速度必不为零\n", + "B. 法向加速度必不为零(拐点处除外)\n", + "C. 由于速度沿切线方向,法向分速度必为零,因此法向加速度必为零\n", + "D. 若物体作匀速率运动,其总加速度必为零\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: B \n", + "\n", + "=======end 17=======\n", + " 95% 18/19 [00:01<00:00, 16.15it/s]\n", + "=======begin 18=======\n", + "question: 有三个直径相同的金属小球.小球1和小球2带等量异号电荷,两者的距离远大于小球直径,相互作用力为F.小球3不带电并装有绝缘手柄.用小球3先和小球1碰一下,接着又和小球2碰一下,然后移去.则此时小球1和2之间的相互作用力为____\n", + "A. 0\n", + "B. F/4\n", + "C. F/3\n", + "D. F/2\n", + "答案是什么? \n", + "response: 根据\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 18=======\n", + "100% 19/19 [00:01<00:00, 16.08it/s]\n", + "Subject: college_physics\n", + "Acc: 21.05263157894737\n", + "0.8269230769230769 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of discrete_mathematics!\n", + " 0% 0/16 [00:002\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 4=======\n", + "\n", + "=======begin 5=======\n", + "question: 六阶群的任何非平凡子群一定不是____\n", + "A. 2阶\n", + "B. 5阶\n", + "C. 3阶\n", + "D. 6阶\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 5=======\n", + " 38% 6/16 [00:00<00:00, 16.11it/s]\n", + "=======begin 6=======\n", + "question: 下面说法____是错误的\n", + "A. 不存在既自反又反自反的关系\n", + "B. 存在即对称又反对称的关系\n", + "C. 存在即不对称又不反对称的关系\n", + "D. 由一个有序对构成的二元关系一定是一个传递关系。\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 6=======\n", + "\n", + "=======begin 7=======\n", + "question: 设图G是有6个顶点的连通图,总度数为20,则从G中删去____边后使之变成树。\n", + "A. 10\n", + "B. 5\n", + "C. 3\n", + "D. 2\n", + "答案是什么? \n", + "response: 根据\n", + "ans: D\n", + "ground truth: B \n", + "\n", + "=======end 7=======\n", + " 50% 8/16 [00:00<00:00, 16.32it/s]\n", + "=======begin 8=======\n", + "question: 设G为平面图,则下面可能不正确的选项是_____\n", + "A. G = (G*)*\n", + "B. G*= ((G*)*)*\n", + "C. (G*)* = (G*)*)*)*\n", + "D. ((G*)*)*= ((((G*)*)*)*)*\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 8=======\n", + "\n", + "=======begin 9=======\n", + "question: 已知一棵树T中有度为5,4,3,2的顶点各一个,其余为树叶顶点,T的树叶顶点数为哪项?____\n", + "A. 8\n", + "B. 7\n", + "C. 6\n", + "D. 5\n", + "答案是什么? \n", + "response: 根据\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 9=======\n", + " 62% 10/16 [00:00<00:00, 16.33it/s]\n", + "=======begin 10=======\n", + "question: 含5个顶点、3条边的不同构的简单图有几个?____\n", + "A. 2\n", + "B. 3\n", + "C. 4\n", + "D. 5\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 10=======\n", + "\n", + "=======begin 11=======\n", + "question: 下面说法错误的是____\n", + "A. 邻接矩阵能表示自环,也能表示重边\n", + "B. 有向图邻接矩阵的第$i$行非零元的数目恰好是$\\nu_{i}$的正度。第$j$列非零元的数目是$\\nu_{j}$的负度\n", + "C. 关联矩阵能表示重边,不能表示自环\n", + "D. 有向图关联矩阵第$i$行中1 的数目是$\\nu_{i}$的正度,-1 的数目是$\\nu_{i}$的负度。\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 11=======\n", + " 75% 12/16 [00:00<00:00, 16.30it/s]\n", + "=======begin 12=======\n", + "question: 下式不一定成立的是____\n", + "A. $(\\forall x)(P(x)\\wedge Q(x))=(\\forall x)P(x)\\wedge(\\forall x)Q(x)$\n", + "B. $(\\exists x)(P(x)\\lor Q(x))=(\\exists x)P(x)\\lor(\\exists x)Q(x)$\n", + "C. $(\\forall x)(\\forall y)(P(x)\\lor{Q(x)}(y))=(\\forall x)P(x)\\lor(\\forall x)Q(x)$\n", + "D. $(\\exists x)(P(x)\\wedge Q(x))=(\\exists x)P(x)\\wedge(\\exists x)Q(x)$\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 12=======\n", + "\n", + "=======begin 13=======\n", + "question: 设A={2,3,4,6,9,12,18},A中的整除关系R是偏序关系,那么在偏序集中,下列说法正确的是____\n", + "A. A的最大元素是18;\n", + "B. A 的最小元素是2;\n", + "C. B= {3,9}的上界只有 18;\n", + "D. B= {4,9} 没有上下界。\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 13=======\n", + " 88% 14/16 [00:00<00:00, 16.17it/s]\n", + "=======begin 14=======\n", + "question: 设G为平面图,则下面可能不连通的图是____\n", + "A. G的闭合图\n", + "B. G*\n", + "C. (G*)*\n", + "D. (((G)*)*)*\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: A \n", + "\n", + "=======end 14=======\n", + "\n", + "=======begin 15=======\n", + "question: 一个无向图有五个结点,其中4个的度数是1,2,3,4,则第5个结点的度数不可能是____\n", + "A. 0\n", + "B. 2\n", + "C. 4\n", + "D. 5\n", + "答案是什么? \n", + "response: D\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 15=======\n", + "100% 16/16 [00:00<00:00, 16.28it/s]\n", + "Subject: discrete_mathematics\n", + "Acc: 43.75\n", + "0.8461538461538461 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of mao_zedong_thought!\n", + " 0% 0/24 [00:00\\frac{1}{2}\\mid Y=\\frac{1}{3}\\right)=$$____\n", + "A. $\\frac{3}{4}$\n", + "B. $\\frac{2}{3}$\n", + "C. $\\frac{1}{4}$\n", + "D. $\\frac{1}{3}$\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: A \n", + "\n", + "=======end 1=======\n", + " 11% 2/18 [00:00<00:01, 15.64it/s]\n", + "=======begin 2=======\n", + "question: 总体的简单样本,$\\bar{X}$为样本均值,则$D(\\bar{X})=$____\n", + "A. $\\frac{3}{80}$\n", + "B. $\\frac{9}{16}$\n", + "C. $\\frac{3}{1600}$\n", + "D. $\\frac{3}{160}$\n", + "答案是什么? \n", + "response: 这\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 2=======\n", + "\n", + "=======begin 3=======\n", + "question: 设总体$X$服从拉普拉斯分布$f(x,\\lambda)=\\frac{1}{4\\lambda}e^{-\\frac{|x|}{2\\lambda}},-\\infty0$。则$E(|X|)=$____\n", + "A. $\\frac{1}{2 \\lambda}$\n", + "B. $\\frac{1}{\\lambda}$\n", + "C. $2 \\lambda$\n", + "D. $\\lambda$\n", + "答案是什么? \n", + "response: 根据\n", + "ans: A\n", + "ground truth: C \n", + "\n", + "=======end 3=======\n", + " 22% 4/18 [00:00<00:00, 16.22it/s]\n", + "=======begin 4=======\n", + "question: 设$X_1,X_2,\\cdots X_{12}$是来自正态总体$X\\sim N\\left(0,\\sigma^2\\right)$的简单样本,随机变量$Y=\\frac{\\sum_{i=1}^6X_i^2}{\\sum_{j=1}^6X_{j+6}^2}$服从的分布为:____\n", + "A. $\\chi^2(6)$\n", + "B. $\\chi^2(1)$\n", + "C. $F(5,5)$\n", + "D. $F(6,6)$\n", + "答案是什么? \n", + "response: 根据\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 4=======\n", + "\n", + "=======begin 5=======\n", + "question: 对于任意两个随机变量X和$Y$,若$E(XY)=EX\\cdot EY$,则____\n", + "A. $D(X Y)=D(X) \\cdot D(Y)$\n", + "B. $D(X+Y)=D(X)+D(Y)$\n", + "C. X和Y独立\n", + "D. X和Y不相关\n", + "答案是什么? \n", + "response: A\n", + "ans: C\n", + "ground truth: D \n", + "\n", + "=======end 5=======\n", + " 33% 6/18 [00:00<00:00, 16.44it/s]\n", + "=======begin 6=======\n", + "question: 设$(X_1,X_2,...,X_n)$是取自总体X的一个样本,X的概率密度如下:$f(x)=\\begin{cases}\\frac12e^{-\\frac{(x-\\mu)}{2}},x\\geq\\mu,\\\\0,其他\\end{cases}$,$\\mu$为未知参数。则$\\mu$的最大似然估计量是.____\n", + "A. $\\hat{\\mu}=\\max _{1 \\leq i \\leq n} X_i$\n", + "B. $\\hat{\\mu}=\\frac13 \\max _{1 \\leq i \\leq n} X_i$\n", + "C. $\\hat{\\mu}=\\min _{1 \\leq i \\leq n} X_i$\n", + "D. $\\hat{\\mu}=\\frac12 \\min _{1 \\leq i \\leq n} X_i$\n", + "答案是什么? \n", + "response: B\n", + "ans: B\n", + "ground truth: C \n", + "\n", + "=======end 6=======\n", + "\n", + "=======begin 7=======\n", + "question: 当事件$A$和$B$同时发生时$C$也发生,则下列式子中成立的是____\n", + "A. $P(C)=P(A \\cap B)$\n", + "B. $P(C) \\leq P(A)+P(B)-1$\n", + "C. $P(C)=P(A \\cup B)$\n", + "D. $P(C) \\geq P(A)+P(B)-1$\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: D \n", + "\n", + "=======end 7=======\n", + " 44% 8/18 [00:00<00:00, 16.25it/s]\n", + "=======begin 8=======\n", + "question: $$\n", + "\\text{设}00$,$P(X_1^2+X_2^2+X_3^2+X_4^2\\le kX_5^2)=\\alpha$则k=____\n", + "A. $\\frac{1}{4}F_{\\alpha}(4,1)$\n", + "B. $\\frac{1}{4}F_{1-\\alpha}(4,1)$\n", + "C. $4F_{\\alpha}(4,1)$\n", + "D. $4F_{1-\\alpha}(4,1)$\n", + "答案是什么? \n", + "response: 根据\n", + "ans: D\n", + "ground truth: D \n", + "\n", + "=======end 15=======\n", + " 89% 16/18 [00:01<00:00, 15.69it/s]\n", + "=======begin 16=======\n", + "question: 设$X_1,X_1,\\cdots X_8$为来自总体$X\\sim N\\left(\\mu_1,1\\right)$的简单样本,$\\bar{X},S_1^2$分別是其对应的样本均值与样本方差。$Y_1,Y_1,\\cdots,Y_7$为来自总$Y\\sim N\\left(\\mu_2,1\\right)$的简单样本,$\\bar{Y},S_2^2$分别是其对应的样本均值与样本方差。下列选项正确的是:____\n", + "A. $\\sum_{i=1}^8\\left(X_i-\\mu_1\\right)^2+\\sum_{i=1}^7\\left(Y_i-\\mu_2\\right)^2 \\sim \\chi^2(15)$\n", + "B. $E\\left(\\sum_{i=1}^8\\left(X_i-\\mu_1\\right)^2+\\sum_{i=1}^7\\left(Y_i-\\mu_2\\right)^2\\right)=15$\n", + "C. $\\mathrm{D}(\\bar{X}+\\bar{Y})=\\frac{1}{8}+\\frac{1}{7}$\n", + "D. $\\bar{X}-\\bar{Y} \\sim \\mathrm{N}\\left(\\mu_1-\\mu_2, \\frac{1}{8}+\\frac{1}{7}\\right)$\n", + "答案是什么? \n", + "response: A\n", + "ans: A\n", + "ground truth: B \n", + "\n", + "=======end 16=======\n", + "\n", + "=======begin 17=======\n", + "question: 若随机变量X的分布函数为$F(x)=pF_1(x)+qF_2(x)$,其中$F_1(x)$,$F_2(x)$为两个分布函数,常数p,q满足:$p>0$,$q>0$,$p+q=1$,那么X的分布叫作$F_1(x),F_2(x)$的混合分布.设$\\mu_1,\\mu_2$分别为$F_1(x),F_2(x)$的期望,$\\sigma_1^2,\\sigma_2^2$分别为$F_1(\\mathrm{x})$,$F_2(\\mathrm{x})$的方差,则$DX=$____\n", + "A. $p \\sigma_1^2+q \\sigma_2^2$\n", + "B. $p^2 \\sigma_1^2+q^2 \\sigma_2^2$\n", + "C. $p \\sigma_1^2+q \\sigma_2^2+p q\\left(\\mu_1-\\mu_2\\right)^2$\n", + "D. $p \\sigma_1^2+q \\sigma_2^2+p q\\left(\\sigma_1-\\sigma_2\\right)^2$\n", + "答案是什么? \n", + "response: A\n", + "ans: D\n", + "ground truth: C \n", + "\n", + "=======end 17=======\n", + "100% 18/18 [00:01<00:00, 15.90it/s]\n", + "Subject: probability_and_statistics\n", + "Acc: 22.22222222222222\n", + "0.9615384615384616 Inference starts at 2023-06-16_00-47-07 on /content/alpaca-combined-hf with subject of high_school_chinese!\n", + " 0% 0/19 [00:00