大佬有没有兴趣再搞一搞llama3-8b
1
#24 opened 7 months ago
by
AlexLee01
Hello
#23 opened 8 months ago
by
huangfeilong
model = model.half().quantize(4).cuda() 运行显示错误"AttributeError: 'Linear' object has no attribute 'bias'"
1
#22 opened 8 months ago
by
Frank1983823
请问训练所用的数据集能否公开?
1
#21 opened 8 months ago
by
sssssimeng
请问作者,rlhf的actor loss是否下降和正常收敛呢?能不能给一些经验的超参数设置?请教了
4
#20 opened 12 months ago
by
hepansls
Cannot copy out of meta tensor; no data!,报错代码地方为model = model.half().quantize(4).cuda() ,猜测是量化相关问题或者作者的模型上传的时候有遗漏的文件
4
#18 opened about 1 year ago
by
yoma0101
关于两种加载模型文件方式的区别
1
#17 opened about 1 year ago
by
rk686
如何在多gpu上加载
1
#16 opened about 1 year ago
by
jersonal
Lora和RLHF训练的代码开源了吗
9
#14 opened about 1 year ago
by
tenghg
还可以再次进行自我认知的lora的训练吗
2
#11 opened about 1 year ago
by
goodboys
api 调用您的模型出现错误
5
#10 opened over 1 year ago
by
neteasy
牛的 无限长的原理和chatglm2是一个道理吗?
3
#9 opened over 1 year ago
by
szu2018chenli
测试了一下,很好用,比chatglm2还好用。期待更多作品。
1
#8 opened over 1 year ago
by
alfgo
训练代码
1
#6 opened over 1 year ago
by
xiazhentao
启动加载很慢,需要130秒
4
#5 opened over 1 year ago
by
devillaws
的确很好用!!
1
#4 opened over 1 year ago
by
lii1314520
会不会基于ChatGLM2-6B进行迭代?
3
#3 opened over 1 year ago
by
neteasy