Total Trainable param #346

Guardian-in-the-WF · 2024-11-29T09:37:09Z

Why did I initialize the otter model with the results about "Total Trainable param: 0.000000 B" after
accelerator = Accelerator(
gradient_accumulation_steps=args.gradient_accumulation_steps,
mixed_precision="no", # mark origin: bf16
)
? I run that code on the RTX A6000 machine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Total Trainable param #346

Total Trainable param #346

Guardian-in-the-WF commented Nov 29, 2024

Total Trainable param #346

Total Trainable param #346

Comments

Guardian-in-the-WF commented Nov 29, 2024