微调Qwen1.5-7B-Chat模型测试记录 #65
Valdanitooooo
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
微调的目标设定
环境准备
硬件
一块16GB显存的nvidia显卡
cuda
cuda版本和加下来多个python库的安装息息相关,选择时要先做好调研
略
https://developer.nvidia.com/cuda-toolkit-archive
笔者这里选择12.4.0
LLaMA-Factory
https://github.com/hiyouga/LLaMA-Factory#dependence-installation
flash-attn
可选,发现微调后用flash-attn加载有时会报错
autoawq
用的awq量化模型所以要装这个,如果用gptq的模型就装
auto-gptq
启动LLaMA-Factory的 train_web 服务
成功后自动跳转到浏览器 http://localhost:7860/
微调数据集准备
自我认知数据集
准备一个json文件
self-awareness.json
,放到LLaMA-Factory/data/
目录中self-awareness.json
内容:更新dataset_info.json文件
在
LLaMA-Factory/data/dataset_info.json
后面中加一条准备好后刷新
刷新 http://localhost:7860/ 页面即可看到新的数据集
点击预览
![image](https://private-user-images.githubusercontent.com/26271661/325578830-9213df22-313b-403f-ae56-332e02e3228d.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1Nzg4MzAtOTIxM2RmMjItMzEzYi00MDNmLWFlNTYtMzMyZTAyZTMyMjhkLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTkwMzBlZGFkNjFmNzgzOTgwOTA0Y2UyM2E3MmVjNjdiZjFmNDFmMzgxZTBjYjRhZjQ4NjNjZWQxZGQ1MWZhMjgmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.H1JwWOi09-8D0icwr52-NOBNp0vWlx2By5Gpy6aAV4o)
微调前测试
如图,在chat页加载模型后进行对话
![image](https://private-user-images.githubusercontent.com/26271661/325559888-f2881c82-4007-498c-aec3-ae8cb45dd398.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1NTk4ODgtZjI4ODFjODItNDAwNy00OThjLWFlYzMtYWU4Y2I0NWRkMzk4LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWI0YjgxYTBkMmRkNGUwYTk2M2E5NzMwOTg3YTI4MTZiMWU0NTk4MTIyYTBhZDM0MGQ4MzBhMDMxNGZmYmE2NTkmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.S20EHnOOYLQvWDeJY_Z74lejfHEfM0gi0ThGpZ-IRyc)
微调
修改的几处配置
后来把训练轮数改成了200 😆
![2024-04-25_19-11-29](https://private-user-images.githubusercontent.com/26271661/325586321-cd885ada-28cc-4ec7-8df7-c07386558823.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1ODYzMjEtY2Q4ODVhZGEtMjhjYy00ZWM3LThkZjctYzA3Mzg2NTU4ODIzLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWNiN2I5ODQ2OGFkMjY2ZWZkMzljMzVjMTU3NmMwOTNhZjgyN2M0NWNhMDMwYmEyZjQ4NTZkNDExZDg0Y2Y3NGMmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.gYvXHSDkbx95YO0iUbN7vEJcSTyw1-ZBe07qbJHxi_I)
![2024-04-25_19-11-58](https://private-user-images.githubusercontent.com/26271661/325586389-ecdf763e-f0d8-4658-8b4c-5f297b9f1d32.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1ODYzODktZWNkZjc2M2UtZjBkOC00NjU4LThiNGMtNWYyOTdiOWYxZDMyLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTFhODFmYjBmZjJkZWI3ZWJkOWRiZTcxZjFiNTcwZjRlYzJkNjc3OWMzOTdmYWNkNzQxY2VlOGQyZThjMWMwNDImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.NQrTWvDUp8ySmfMMQRaxqSdgN7jTz15g-8L92vAt35E)
预览命令
开始
微调过程显存占用不算高
![2024-04-25_19-13-15](https://private-user-images.githubusercontent.com/26271661/325586514-636baa82-b21e-4acc-89e1-53ac031a70e8.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1ODY1MTQtNjM2YmFhODItYjIxZS00YWNjLTg5ZTEtNTNhYzAzMWE3MGU4LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTkyNWI1ZDQ4NDEwMzVkY2M0YTk4NWUyZDIxOTk0NDRlOGI5MmM2MzJmNmE0MGNhYmY4OTk0ZDJmY2RkYTIxMDUmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.OvOQv9ZI3wQUPn49-6UtSgHRKnGbXgreOXrF4Alrtuo)
一不留神就结束了
微调后测试
切到Chat页
![image](https://private-user-images.githubusercontent.com/26271661/325589778-17a2f2cd-3653-4736-a928-0e7b58e46e90.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1ODk3NzgtMTdhMmYyY2QtMzY1My00NzM2LWE5MjgtMGU3YjU4ZTQ2ZTkwLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWFmZDljMmE4MGFhMzNkZGNhZGM2OWYxYjk5NmYzYWE2ZGZmMzUyNzMwMTVmMGM0MDRlODcxMDdiZDBjMGY0YzEmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.lhUn1XLmBPB89e_F_g4JML-j6uk9Iz5h3oAN8OvN6cQ)
![image](https://private-user-images.githubusercontent.com/26271661/325590868-3d5218ba-349f-4995-b243-f3b1f23e3e70.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1OTA4NjgtM2Q1MjE4YmEtMzQ5Zi00OTk1LWIyNDMtZjNiMWYyM2UzZTcwLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWMxNTdiZjBhMTQ3MGMwYjcyOTIxMmRkM2MwNzJjZTA4Njg4ZDkyN2U0YzU0NzM2MmQwYWMxMWQ0YzMwNGJhYTkmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.vEKgSK83xPHpTX1ZoXLbCGr8wYxGWfxxiuarrEdyHGM)
刷新适配器,选择self-awareness,测试结果符合预期
导出模型
切到Export页
导出报错,符合预期
![image](https://private-user-images.githubusercontent.com/26271661/325592306-9a1e1edc-5b76-44f3-8cac-07105eee6edd.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1OTIzMDYtOWExZTFlZGMtNWI3Ni00NGYzLThjYWMtMDcxMDVlZWU2ZWRkLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTRmZGZiY2VlMDAyYTJlOGU4ZDczOTRhNjk3MGNmNDM0ZjZjNDdjNDY3YzdjNmMyMDc4MDk3ZjNkOWMzZTZjNWImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.I1sszKzywx8ctqxhp_pqELy93YUeCgnzvABx9ZYYhMQ)
把这里改成非量化的模型地址就可以啦
![image](https://private-user-images.githubusercontent.com/26271661/325593576-4f73299e-314d-4492-a67a-1da7f6235359.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMTAwODQsIm5iZiI6MTczOTMwOTc4NCwicGF0aCI6Ii8yNjI3MTY2MS8zMjU1OTM1NzYtNGY3MzI5OWUtMzE0ZC00NDkyLWE2N2EtMWRhN2Y2MjM1MzU5LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDIxMzYyNFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTUwZmYxMDc4ZTlkZjFjMTM4Njg3ZjdmZTUxNTJiMTRhMzdmNTYyODYxNjEyY2EyOGJhMTVmYjUyOTMzNGJkYTImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.59riFK4fwp2YLisHXQl9uiMoYlKmZDmvEOv69C38o0Y)
再测一下训练好的模型
研发团队不符合预期
修改参数重新微调
把训练轮数改成了200,测试结果符合预期
训练时间大概五分钟
Beta Was this translation helpful? Give feedback.
All reactions