Performance Data

Full Benchmark Results

All tests on NVIDIA Jetson AGX Orin 64 GB, JetPack 6.x, HammerIO nvCOMP GPU LZ4.

Summary Statistics

3.6s

7B model restore

7.9s

13B model restore

391 MB/s

Peak decompress

100%

SHA-256 PASS rate

Detailed Results

Model Size	Original	Compressed	Ratio	Compress	Decompress	Restore	Integrity
1 GB (7B)	1.0 GB	875.1 MB	1.17x	208 MB/s	283 MB/s	3.615s	PASS
3 GB (13B)	3.0 GB	2.4 GB	1.24x	237 MB/s	391 MB/s	7.861s	PASS
7 GB (70B)	7.0 GB	5.7 GB	1.23x	289 MB/s	378 MB/s	18.979s	PASS

Synthetic model weight data — float32 arrays with realistic distribution. Ratios reflect actual weight compressibility. Powered by HammerIO nvCOMP GPU LZ4.

Test Environment

Hardware: NVIDIA Jetson AGX Orin 64 GB
SDK: JetPack 6.x
Compression: HammerIO nvCOMP GPU LZ4 (files >500 MB), CPU zstd (files <500 MB)
Integrity: SHA-256 hash verification before and after every operation
Data: Synthetic float32 model weight arrays with realistic distribution

Methodology

Each benchmark run consists of a full store-compress-verify-restore cycle. Timing begins at the start of the compress or decompress operation and ends when the operation completes and integrity is verified. All times are wall-clock. Three runs per model size; median reported.

Compression Strategy

Files greater than 500 MB use nvCOMP GPU LZ4 via HammerIO for maximum throughput
Files less than 500 MB use CPU zstd for efficient handling without GPU overhead
SHA-256 hashes are computed before compression and verified after decompression
All operations are logged to the ModelSafe manifest for audit trail