500条数据训练完全无效果
#6219
Replies: 1 comment 1 reply
-
你这数据集格式对吗?我只用过alpaca和sharegpt格式的数据集,有效果。 |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
不知为何训练数据足够但是效果差几乎没有起到任何影响,求大佬指教!
System Info
llamafactory
version: 0.9.1.dev0以下是我的一些参数和数据集内容
![image](https://private-user-images.githubusercontent.com/106812293/391313785-2b466735-91cf-4cfc-8c1f-41712f3d369c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNTE4NTgsIm5iZiI6MTczOTM1MTU1OCwicGF0aCI6Ii8xMDY4MTIyOTMvMzkxMzEzNzg1LTJiNDY2NzM1LTkxY2YtNGNmYy04YzFmLTQxNzEyZjNkMzY5Yy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEyJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMlQwOTEyMzhaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lMGY0YjNmOGVmMDZjYTFlZDMyNjE2ZTM5ZTQ3MzY3OGJmOTk1NDNiNzRiOTZiMDJmYzAxYjg5ODk0NjhjY2E0JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.x0Qs5FB1DA7s5FRMUa1DEdM0iXp3TCkaUqRatNo2FOc)
![image](https://private-user-images.githubusercontent.com/106812293/391313808-9ee35846-9c2b-4aaa-8393-13a9bb03855a.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNTE4NTgsIm5iZiI6MTczOTM1MTU1OCwicGF0aCI6Ii8xMDY4MTIyOTMvMzkxMzEzODA4LTllZTM1ODQ2LTljMmItNGFhYS04MzkzLTEzYTliYjAzODU1YS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEyJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMlQwOTEyMzhaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT05YjZmNGE4OTIxNjI4NjEzNDBjZWFhZGZjNTE5YjNmMmNhYWFmNGEyMzdlNjM0ZThjZjlhNDVjOGE0MDkwOTk1JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.JRLMGKIpoK8QeL06TQ4CLLBeomO3gXnp-483UYnecck)
![image](https://private-user-images.githubusercontent.com/106812293/391313823-85f27fdb-172c-4379-91b2-1efbdb0e1114.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNTE4NTgsIm5iZiI6MTczOTM1MTU1OCwicGF0aCI6Ii8xMDY4MTIyOTMvMzkxMzEzODIzLTg1ZjI3ZmRiLTE3MmMtNDM3OS05MWIyLTFlZmJkYjBlMTExNC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEyJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMlQwOTEyMzhaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1iYTNhMTU0Yzg2NGM5ZGExODcwNGI1OGEzNmE1OWJhMThhMGQzOWUyN2E5Yzc0OTQ3YzM4M2E5NmY5NDdiMTFhJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.9Kvnm6VIZ_Q2SwapGlMstjJ-mvzdWs5lw3-nSin9E3s)
![losses](https://private-user-images.githubusercontent.com/106812293/391314313-5e37053c-b25f-43c5-86d2-3a0c390e4d22.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNTE4NTgsIm5iZiI6MTczOTM1MTU1OCwicGF0aCI6Ii8xMDY4MTIyOTMvMzkxMzE0MzEzLTVlMzcwNTNjLWIyNWYtNDNjNS04NmQyLTNhMGMzOTBlNGQyMi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEyJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMlQwOTEyMzhaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kMDA3ZDAzMzA0NDFmZDc0Y2I1MzIwMDFhMjg5YTkwZjdhOTYyZjlhMDc2NDcwNWYwNmZmZTM4MjE3NDUwZGEyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.pMqpaCPUooG_HSAnpRla5arP4dToWr-00GKyXXnkXs8)
以下是我的训练loss图
训练脚本:
在我发讨论的时候GPU资源不足了,而我又没有留存截图,只能大致描述一下,实在抱歉!
我确定勾选了训练好的检查点路径,使用huggingface方式加载了模型和检查点,但是效果和没训练一样。我又试了
identity.json
,并勾选了训练好的检查点路径,使用huggingface方式加载了模型和检查点。但结果还是一样的,模型依旧回答自己是千问大模型。请问大佬,这种是什么问题呢?是数据集问题还是参数问题,抑或是我的打开方式不正确?小白求教🙏
Beta Was this translation helpful? Give feedback.
All reactions