-
Notifications
You must be signed in to change notification settings - Fork 41
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
MLPerf default config version with bert & 3d-unet benchmarks (#392)
* Using CUDA 12.4 & MLPerf Inference results 4.1. * MLPerf metrics parser using metadata.json from inference results v4.1. * Passing unit test, adding 3d-unet benchmark. * Passing functional test. * Passing CUDAAndNVIDIAGPUDriverInstallationTest test cases. * Passing NvidiaContainerToolKitInstallationTests cases. * Documentation. * String interpolation for the parser. * Documentation updates. * Changing expected latency for 3d-unet single stream. * Review changes. * Documentation for adding custom configs information. * Documentation fix. * Increment VERSION. * Updating parser to have simpler names for metrics. * Fixing a typo. * Updating documentation with new metric names. --------- Co-authored-by: saibulusu <[email protected]>
- Loading branch information
Showing
40 changed files
with
3,256 additions
and
6,794 deletions.
There are no files selected for viewing
25 changes: 22 additions & 3 deletions
25
src/VirtualClient/TestResources/Results_MLPerf_Accuracy_Summary.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,24 @@ | ||
{ | ||
"A100-PCIe-80GBx4_TRT-custom_k_99_9_MaxP-Server": { "bert": "Accuracy = 91.873, Threshold = 90.783. Accuracy test PASSED." }, | ||
"A100-PCIe-80GBx4_TRT-custom_k_99_9_MaxP-SingleStream": { "bert": "Accuracy = 91.568, Threshold = 90.783. Accuracy test PASSED." }, | ||
"A100-PCIe-80GBx4_TRT-custom_k_99_9_MaxP-Offline": { "bert": "Accuracy = 91.832, Threshold = 90.783. Accuracy test FAILED." } | ||
"accuracy": [ | ||
{ | ||
"name": "F1", | ||
"pass": true, | ||
"threshold": 89.96526, | ||
"value": 90.2147015680108 | ||
} | ||
], | ||
"accuracy_pass": true, | ||
"benchmark_full": "bert-99", | ||
"benchmark_short": "bert", | ||
"config_name": "DGX-A100_A100-SXM4-40GBx8_TRT-custom_k_99_MaxP-Offline", | ||
"detected_system": "SystemConfiguration(host_cpu_conf=CPUConfiguration(layout={CPU(name=\"AMD EPYC 7V12 64-Core Processor\", architecture=CPUArchitecture.x86_64, core_count=48, threads_per_core=1): 2}), host_mem_conf=MemoryConfiguration(host_memory_capacity=Memory(quantity=928.7656999999999, byte_suffix=ByteSuffix.GB), comparison_tolerance=0.05), accelerator_conf=AcceleratorConfiguration(layout={GPU(name=\"NVIDIA A100-SXM4-40GB\", accelerator_type=AcceleratorType.Discrete, vram=Memory(quantity=40.0, byte_suffix=ByteSuffix.GiB), max_power_limit=400.0, pci_id=\"0x20B010DE\", compute_sm=80): 8}), numa_conf=NUMAConfiguration(numa_nodes={}, num_numa_nodes=4), system_id=\"DGX-A100_A100-SXM4-40GBx8\")", | ||
"effective_min_duration_ms": 600000, | ||
"effective_samples_per_query": 19800000, | ||
"satisfies_query_constraint": true, | ||
"scenario": "Offline", | ||
"scenario_key": "result_samples_per_second", | ||
"summary_string": "[PASSED] F1: 90.215 (Threshold=89.965)", | ||
"system_name": "DGX-A100_A100-SXM4-40GBx8_TRT", | ||
"tensorrt_version": "10.2.0", | ||
"test_mode": "AccuracyOnly" | ||
} |
18 changes: 16 additions & 2 deletions
18
src/VirtualClient/TestResources/Results_MLPerf_Harness_Summary.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,18 @@ | ||
{ | ||
"A100-PCIe-80GBx4_TRT_Triton-triton_k_99_9_MaxP-Server": { "bert": "result_scheduled_samples_per_sec: 4751.78, Result is INVALID" }, | ||
"A100-PCIe-80GBx4_TRT_Triton-triton_k_99_9_MaxP-SingleStream": { "bert": "result_90.00_percentile_latency_ns: 2202969, Result is VALID" } | ||
"benchmark_full": "bert-99", | ||
"benchmark_short": "bert", | ||
"config_name": "DGX-A100_A100-SXM4-40GBx8_TRT-custom_k_99_MaxP-Server", | ||
"detected_system": "SystemConfiguration(host_cpu_conf=CPUConfiguration(layout={CPU(name=\"AMD EPYC 7V12 64-Core Processor\", architecture=CPUArchitecture.x86_64, core_count=48, threads_per_core=1): 2}), host_mem_conf=MemoryConfiguration(host_memory_capacity=Memory(quantity=928.7656999999999, byte_suffix=ByteSuffix.GB), comparison_tolerance=0.05), accelerator_conf=AcceleratorConfiguration(layout={GPU(name=\"NVIDIA A100-SXM4-40GB\", accelerator_type=AcceleratorType.Discrete, vram=Memory(quantity=40.0, byte_suffix=ByteSuffix.GiB), max_power_limit=400.0, pci_id=\"0x20B010DE\", compute_sm=80): 8}), numa_conf=NUMAConfiguration(numa_nodes={}, num_numa_nodes=4), system_id=\"DGX-A100_A100-SXM4-40GBx8\")", | ||
"early_stopping_met": true, | ||
"effective_min_duration_ms": 600000, | ||
"effective_min_query_count": 100, | ||
"result_completed_samples_per_sec": 25405.6, | ||
"result_validity": "VALID", | ||
"satisfies_query_constraint": false, | ||
"scenario": "Server", | ||
"scenario_key": "result_completed_samples_per_sec", | ||
"summary_string": "result_completed_samples_per_sec: 25405.6, Result is VALID, 10-min runtime requirement met: True", | ||
"system_name": "DGX-A100_A100-SXM4-40GBx8_TRT", | ||
"tensorrt_version": "10.2.0", | ||
"test_mode": "PerformanceOnly" | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
24 changes: 24 additions & 0 deletions
24
...tualClient/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_accuracy_summary1.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
{ | ||
"accuracy": [ | ||
{ | ||
"name": "F1", | ||
"pass": true, | ||
"threshold": 89.96526, | ||
"value": 90.2147015680108 | ||
} | ||
], | ||
"accuracy_pass": true, | ||
"benchmark_full": "bert-99", | ||
"benchmark_short": "bert", | ||
"config_name": "DGX-A100_A100-SXM4-40GBx8_TRT-custom_k_99_MaxP-Offline", | ||
"detected_system": "SystemConfiguration(host_cpu_conf=CPUConfiguration(layout={CPU(name=\"AMD EPYC 7V12 64-Core Processor\", architecture=CPUArchitecture.x86_64, core_count=48, threads_per_core=1): 2}), host_mem_conf=MemoryConfiguration(host_memory_capacity=Memory(quantity=928.7656999999999, byte_suffix=ByteSuffix.GB), comparison_tolerance=0.05), accelerator_conf=AcceleratorConfiguration(layout={GPU(name=\"NVIDIA A100-SXM4-40GB\", accelerator_type=AcceleratorType.Discrete, vram=Memory(quantity=40.0, byte_suffix=ByteSuffix.GiB), max_power_limit=400.0, pci_id=\"0x20B010DE\", compute_sm=80): 8}), numa_conf=NUMAConfiguration(numa_nodes={}, num_numa_nodes=4), system_id=\"DGX-A100_A100-SXM4-40GBx8\")", | ||
"effective_min_duration_ms": 600000, | ||
"effective_samples_per_query": 19800000, | ||
"satisfies_query_constraint": true, | ||
"scenario": "Offline", | ||
"scenario_key": "result_samples_per_second", | ||
"summary_string": "[PASSED] F1: 90.215 (Threshold=89.965)", | ||
"system_name": "DGX-A100_A100-SXM4-40GBx8_TRT", | ||
"tensorrt_version": "10.2.0", | ||
"test_mode": "AccuracyOnly" | ||
} |
24 changes: 24 additions & 0 deletions
24
...tualClient/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_accuracy_summary2.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
{ | ||
"accuracy": [ | ||
{ | ||
"name": "F1", | ||
"pass": false, | ||
"threshold": 1.0, | ||
"value": 1.5 | ||
} | ||
], | ||
"accuracy_pass": true, | ||
"benchmark_full": "bert-99", | ||
"benchmark_short": "bert", | ||
"config_name": "DGX-A100_A100-SXM4-40GBx8_TRT-custom_k_99_MaxP-Offline", | ||
"detected_system": "SystemConfiguration(host_cpu_conf=CPUConfiguration(layout={CPU(name=\"AMD EPYC 7V12 64-Core Processor\", architecture=CPUArchitecture.x86_64, core_count=48, threads_per_core=1): 2}), host_mem_conf=MemoryConfiguration(host_memory_capacity=Memory(quantity=928.7656999999999, byte_suffix=ByteSuffix.GB), comparison_tolerance=0.05), accelerator_conf=AcceleratorConfiguration(layout={GPU(name=\"NVIDIA A100-SXM4-40GB\", accelerator_type=AcceleratorType.Discrete, vram=Memory(quantity=40.0, byte_suffix=ByteSuffix.GiB), max_power_limit=400.0, pci_id=\"0x20B010DE\", compute_sm=80): 8}), numa_conf=NUMAConfiguration(numa_nodes={}, num_numa_nodes=4), system_id=\"DGX-A100_A100-SXM4-40GBx8\")", | ||
"effective_min_duration_ms": 600000, | ||
"effective_samples_per_query": 19800000, | ||
"satisfies_query_constraint": true, | ||
"scenario": "Offline", | ||
"scenario_key": "result_samples_per_second", | ||
"summary_string": "[FAILED] F1: 90.215 (Threshold=89.965)", | ||
"system_name": "DGX-A100_A100-SXM4-40GBx8_TRT", | ||
"tensorrt_version": "10.2.0", | ||
"test_mode": "AccuracyOnly" | ||
} |
5 changes: 0 additions & 5 deletions
5
...lient/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_bert_accuracy_summary1.json
This file was deleted.
Oops, something went wrong.
4 changes: 0 additions & 4 deletions
4
...lient/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_bert_accuracy_summary2.json
This file was deleted.
Oops, something went wrong.
4 changes: 0 additions & 4 deletions
4
...t/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_bert_perf_harness_summary1.json
This file was deleted.
Oops, something went wrong.
5 changes: 0 additions & 5 deletions
5
...t/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_bert_perf_harness_summary2.json
This file was deleted.
Oops, something went wrong.
18 changes: 18 additions & 0 deletions
18
...lClient/VirtualClient.Actions.UnitTests/Examples/MLPerf/Example_performance_summary1.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
{ | ||
"benchmark_full": "bert-99", | ||
"benchmark_short": "bert", | ||
"config_name": "DGX-A100_A100-SXM4-40GBx8_TRT-custom_k_99_MaxP-Server", | ||
"detected_system": "SystemConfiguration(host_cpu_conf=CPUConfiguration(layout={CPU(name=\"AMD EPYC 7V12 64-Core Processor\", architecture=CPUArchitecture.x86_64, core_count=48, threads_per_core=1): 2}), host_mem_conf=MemoryConfiguration(host_memory_capacity=Memory(quantity=928.7656999999999, byte_suffix=ByteSuffix.GB), comparison_tolerance=0.05), accelerator_conf=AcceleratorConfiguration(layout={GPU(name=\"NVIDIA A100-SXM4-40GB\", accelerator_type=AcceleratorType.Discrete, vram=Memory(quantity=40.0, byte_suffix=ByteSuffix.GiB), max_power_limit=400.0, pci_id=\"0x20B010DE\", compute_sm=80): 8}), numa_conf=NUMAConfiguration(numa_nodes={}, num_numa_nodes=4), system_id=\"DGX-A100_A100-SXM4-40GBx8\")", | ||
"early_stopping_met": true, | ||
"effective_min_duration_ms": 600000, | ||
"effective_min_query_count": 100, | ||
"result_completed_samples_per_sec": 25405.6, | ||
"result_validity": "VALID", | ||
"satisfies_query_constraint": false, | ||
"scenario": "Server", | ||
"scenario_key": "result_completed_samples_per_sec", | ||
"summary_string": "result_completed_samples_per_sec: 25405.6, Result is VALID, 10-min runtime requirement met: True", | ||
"system_name": "DGX-A100_A100-SXM4-40GBx8_TRT", | ||
"tensorrt_version": "10.2.0", | ||
"test_mode": "PerformanceOnly" | ||
} |
Oops, something went wrong.