#17 - Add RTX PRO 6000 benchmark YAMLs and baseline#162
Conversation
|
| Filename | Overview |
|---|---|
| AGENTS.md | Benchmark table extended with four new RTX PRO 6000 YAML filenames; change is correct and minimal. |
| docs/benchmarks/raw_benchmarking.md | New RTX PRO 6000 tip block added alongside the existing DGX Spark tip; covers all four new YAMLs with hardware-constraint notes and run commands. |
| docs/tutorials/configuration-walkthrough.md | Four new RTX PRO 6000 bullet entries added; one bullet references "Cliff's 800 Gbps cross-card target" — a personal-name reference in public-facing docs. |
| examples/CMakeLists.txt | Four new YAML filenames appended to DAQIRI_BENCH_CONFIGS list; straightforward addition, no logic changed. |
| examples/daqiri_bench_raw_sw_loopback_rtx_pro_6000.yaml | Software-loopback smoke test for RTX PRO 6000; uses map-style bench_rx/bench_tx matching existing loopback config conventions; buf_size 8064 matches payload+header; looks correct. |
| examples/daqiri_bench_raw_tx_rx_rtx_pro_6000.yaml | Generic placeholder template for dual-NIC cross-card 800 Gbps target; uses list-style bench_rx/bench_tx (parser supports both forms); eth_dst_addr and PCIe BDFs correctly left as angle-bracket placeholders; comment references "Cliff's 800 Gbps vision" — personal name in a public file. |
| examples/daqiri_bench_raw_tx_rx_rtx_pro_6000_nic.yaml | Prefilled dual-port NIC config for dev-box PCIe BDFs; eth_dst_addr correctly left as placeholder; hardware limitations documented; list-style bench_rx/bench_tx parsed correctly. |
| examples/daqiri_bench_raw_tx_rx_rtx_pro_6000_nic_same_port.yaml | Experimental same-PF TX+RX config; clearly documented as failing daqiri_init; kept for follow-up; no functional concerns. |
| examples/rtx_pro_6000_baseline.md | Baseline results doc; hardware constraints, measured numbers, follow-up items all clearly stated. |
Reviews (3): Last reviewed commit: "#17 - Doc sync and YAML fixes for RTX PR..." | Re-trigger Greptile
Partial #17: raw GPUDirect configs for RTX PRO 6000 Blackwell plus measured baseline on the dev box. No HDS, RoCE, FFT, or GEMM yet. Signed-off-by: Chloe Crozier <chloecrozier@gmail.com>
Adjust sw loopback IP placeholders; expand raw_benchmarking tip block. Signed-off-by: Chloe Crozier <chloecrozier@gmail.com>
6b83d46 to
694e527
Compare
Partially addresses #17
RTX PRO 6000 benchmark YAMLs and results from our development system.
What this PR adds
examples/rtx_pro_6000_baseline.mdwith measured results and hardware constraintsResults (no cable between ports)
Software loopback does not use the NIC. It confirms the GPUDirect build path but is not a wire-rate measurement. The NIC test shows transmit works at 382 Gbps; receive stayed at zero because the two ports are not connected to each other.
How to run
Hardware on the development system
61:00.0and61:00.1; both ports report link upTests completed without additional hardware
Next steps
61:00.0and61:00.1Outside #17, we will address these in the future:
HDS, RoCE closed-loop, FFT, GEMM, and Python bindings (#18).