-
Notifications
You must be signed in to change notification settings - Fork 85
Issues: Lightning-AI/lightning-thunder
Label tracking meta-issue (edit me to get automatically CC'ed...
#72
opened Mar 25, 2024 by
carmocca
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
thunder.jit
has a relatively high CPU overhead when processing small graphs with small inputs.
performance
#1657
opened Jan 17, 2025 by
kiya00
nvFuser using more memory than inductor for HF CausalLMLoss
memory use
nvfuser
#1654
opened Jan 17, 2025 by
riccardofelluga
Utility for measuring the CPU and GPU times of fusion regions for a particular backend in a model
enhancement
New feature or request
#1638
opened Jan 10, 2025 by
kevinstephano
module with buffer requires wrapper module to avoid
jit_ext
error
#1637
opened Jan 10, 2025 by
ali-alshaar7
Show return values New feature or request
type_str
in printing unpack_sequence (to have the information in backward trace like forward trace)
enhancement
#1635
opened Jan 10, 2025 by
crcrpar
backward creates inconsistent proxies between args and unpacking them
autograd
tracing architecture
#1633
opened Jan 10, 2025 by
t-vi
avoid joint trace in rematerialize forward backward
rematerialization
#1618
opened Jan 8, 2025 by
t-vi
make traces own proxies and bsyms
enhancement
New feature or request
tracing architecture
#1606
opened Jan 6, 2025 by
t-vi
Default value for TensorProxy's requires_grad argument is invalid
#1594
opened Dec 30, 2024 by
IvanYashchuk
Allow Proxy creation without active TraceCtx
enhancement
New feature or request
tracing architecture
#1593
opened Dec 30, 2024 by
IvanYashchuk
nvFuser has a faster RMSNorm fusion definition than thunder's RMSNorm decomposition
operators
performance
#1582
opened Dec 23, 2024 by
mruberry
Get dynamic shapes to work with Phi-3-mini-128k-instruct
enhancement
New feature or request
nemo
Issues needed to support NVIDIA NeMo models.
#1579
opened Dec 20, 2024 by
tfogal
Consider adding is_leaf attribute to TensorProxies
enhancement
New feature or request
#1577
opened Dec 20, 2024 by
beverlylytle
thunderfx : detecting parameters and buffers on thunderfx path
jit
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1575
opened Dec 19, 2024 by
kshitij12345
Strides of 2D column major Tensor seem to be unexpectedly changed
#1572
opened Dec 19, 2024 by
crcrpar
"requires_grad" attribute on intermediate TensorProxies is unused and misleading
autograd
developer efficiency
#1570
opened Dec 18, 2024 by
IvanYashchuk
Add custom logsigmoid grad for PyTorch executor
autograd
operators
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1555
opened Dec 13, 2024 by
mruberry
Investigate Memory and Performance difference using Issues needed to support NVIDIA NeMo models.
performance
thunderfx
for things that could be applicable to the dynamo+thunder frontend
nvfuser
vs torch.compile
executor on Qwen2
high priority
memory use
nemo
#1552
opened Dec 13, 2024 by
kshitij12345
Feature: Provide a mechanism for practitioners to select different executors per FX graph when using ThunderFX
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1550
opened Dec 12, 2024 by
mruberry
UX: Don't validate tensor metadata for parameter tensors by default
performance
thunderfx
for things that could be applicable to the dynamo+thunder frontend
ux
#1542
opened Dec 11, 2024 by
mruberry
decomposition for torch.minimum, torch.maximum
nvfuser
operators
program-coverage
Requests for model and program coverage
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1537
opened Dec 10, 2024 by
t-vi
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.