We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
现在Internevo代码中的tflops直接按照公式计算,但是当使用tp或者pp的时候模型被切开了,导致tflops不准确
官方镜像代码
No response
The text was updated successfully, but these errors were encountered:
@li126com 帮忙看看
Sorry, something went wrong.
我们这里计算的tflops指的是整体的而非 per GPU,所以不需要考虑tp pp这些,类似于megtron中在算total_flops时也不需要考虑
sunpengsdu
li126com
No branches or pull requests
Describe the bug
现在Internevo代码中的tflops直接按照公式计算,但是当使用tp或者pp的时候模型被切开了,导致tflops不准确
Environment
官方镜像代码
Other information
No response
The text was updated successfully, but these errors were encountered: