trx-bench results for AMD G-T40E

This is merely a historical archive of years 2008-2021, before the migration to mailman3.

A maintained and still updated list archive can be found at https://lists.osmocom.org/hyperkitty/list/OpenBSC@lists.osmocom.org/.

Harald Welte laforge at gnumonks.org
Mon Jan 18 19:13:00 UTC 2016


Hi Thomas,

in case you'd like to add it to your benchmark results, below is a run
of trx-bench on a Debian stable (jessie) system on a G-T40E CPU @ 800MHz.

What I find surprising is that it appears to be comparable than the Atom
D2500 @ 1.86GHz, despite the Atom running at twice the clock rate...

More details see attachments.

Regards,
	Harald
-- 
- Harald Welte <laforge at gnumonks.org>           http://laforge.gnumonks.org/
============================================================================
"Privacy in residential applications is a desirable marketing option."
                                                  (ETSI EN 300 175-7 Ch. A6)
-------------- next part --------------
processor	: 0
vendor_id	: AuthenticAMD
cpu family	: 20
model		: 2
model name	: AMD G-T40E Processor
stepping	: 0
microcode	: 0x5000101
cpu MHz		: 800.000
cache size	: 512 KB
physical id	: 0
siblings	: 2
core id		: 0
cpu cores	: 2
apicid		: 0
initial apicid	: 0
fpu		: yes
fpu_exception	: yes
cpuid level	: 6
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni monitor ssse3 cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch ibs skinit wdt arat hw_pstate npt lbrv svm_lock nrip_save pausefilter vmmcall
bogomips	: 2000.01
TLB size	: 1024 4K pages
clflush size	: 64
cache_alignment	: 64
address sizes	: 36 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

processor	: 1
vendor_id	: AuthenticAMD
cpu family	: 20
model		: 2
model name	: AMD G-T40E Processor
stepping	: 0
microcode	: 0x5000101
cpu MHz		: 800.000
cache size	: 512 KB
physical id	: 0
siblings	: 2
core id		: 1
cpu cores	: 2
apicid		: 1
initial apicid	: 1
fpu		: yes
fpu_exception	: yes
cpuid level	: 6
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni monitor ssse3 cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch ibs skinit wdt arat hw_pstate npt lbrv svm_lock nrip_save pausefilter vmmcall
bogomips	: 2000.01
TLB size	: 1024 4K pages
clflush size	: 64
cache_alignment	: 64
address sizes	: 36 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

-------------- next part --------------

[+] Testing: GSM xCCH (non-recursive, flushed, not punctured)
[.] Input length  : ret = 224  exp = 224 -> OK
[.] Output length : ret = 456  exp = 456 -> OK
[.] Pre computed vector checks:
[..] Encoding: OK
[..] Decoding base: 
[..] Decoding SIMD: 
[..] Code N 2
[..] Code K 5
[..] Decoding: OK

[.] BER tests:
[..] Testing base:
[..] Input BER.......................... 0.023499
[..] Output FER......................... 0.004800
[..] Testing SIMD:
[..] Input BER.......................... 0.023390
[..] Output FER......................... 0.000000

[.] Performance benchmark:
[..] Testing base:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 3.427718 secs
[..] Rate............................... 1.330331 Mbps
[..] Testing SIMD:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 0.425603 secs
[..] Rate............................... 10.714210 Mbps
[..] Speedup............................ 8.053792

[+] Testing: GSM TCH/AFS 7.95 (recursive, flushed, punctured)
[.] Input length  : ret = 165  exp = 165 -> OK
[.] Output length : ret = 448  exp = 448 -> OK
[.] Pre computed vector checks:
[..] Encoding: OK
[..] Decoding base: 
[..] Decoding SIMD: 
[..] Code N 3
[..] Code K 7
[..] Decoding: OK

[.] BER tests:
[..] Testing base:
[..] Input BER.......................... 0.023450
[..] Output FER......................... 0.000300
[..] Testing SIMD:
[..] Input BER.......................... 0.023444
[..] Output FER......................... 0.000000

[.] Performance benchmark:
[..] Testing base:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 11.293366 secs
[..] Rate............................... 0.396693 Mbps
[..] Testing SIMD:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 0.686461 secs
[..] Rate............................... 6.526227 Mbps
[..] Speedup............................ 16.451577

-------------- next part --------------
--- Floating point to integer conversions
-- Testing 40000 iterations of 12480 values
- Measuring conversion time
- Elapsed time base...                 2.548129 secs
- Validating SSE conversion results... PASS
- Measuring conversion time
- Elapsed time SSE ...                 0.960012 secs
- Quotient...                          2.654268

-- Testing 40000 iterations of 3120 values
- Measuring conversion time
- Elapsed time base...                 0.640163 secs
- Validating SSE conversion results... PASS
- Measuring conversion time
- Elapsed time SSE ...                 0.191270 secs
- Quotient...                          3.346908

--- Skipping integer-float conversions (SSE4 only)

-------------- next part --------------
Convolving 156 * 4 samples for 10000 iterations
-----------------
Filter length 4
Base real-complex elapsed time... 0.372130 secs
SSE real-complex elapsed time... 0.241270 secs
Quotient 1.542380

Base complex-complex elapsed time... 0.521219 secs
SSE complex-complex elapsed time... 0.382408 secs
Quotient 1.362992

Filter length 8
Base real-complex elapsed time... 0.573190 secs
SSE real-complex elapsed time... 0.326938 secs
Quotient 1.753207

Base complex-complex elapsed time... 0.873947 secs
SSE complex-complex elapsed time... 0.602019 secs
Quotient 1.451693

Filter length 12
Base real-complex elapsed time... 0.776067 secs
SSE real-complex elapsed time... 0.396321 secs
Quotient 1.958178

Base complex-complex elapsed time... 1.226749 secs
SSE complex-complex elapsed time... 0.752869 secs
Quotient 1.629432

Filter length 16
Base real-complex elapsed time... 0.977423 secs
SSE real-complex elapsed time... 0.497255 secs
Quotient 1.965637

Base complex-complex elapsed time... 1.579161 secs
SSE complex-complex elapsed time... 0.922173 secs
Quotient 1.712435

Filter length 20
Base real-complex elapsed time... 1.179777 secs
SSE real-complex elapsed time... 0.585324 secs
Quotient 2.015596

Base complex-complex elapsed time... 1.931793 secs
SSE complex-complex elapsed time... 1.104140 secs
Quotient 1.749591

Filter length 24
Base complex-complex elapsed time... 2.284363 secs
SSE complex-complex elapsed time... 1.242570 secs
Quotient 1.838418

Filter length 32
Base complex-complex elapsed time... 3.125769 secs
SSE complex-complex elapsed time... 1.562846 secs
Quotient 2.000049

Filter length 64
Base complex-complex elapsed time... 5.945989 secs
SSE complex-complex elapsed time... 2.844491 secs
Quotient 2.090353

Filter length 128
Base complex-complex elapsed time... 11.598979 secs
SSE complex-complex elapsed time... 5.407983 secs
Quotient 2.144788



More information about the OpenBSC mailing list