### CPU-INFO ###
System: x86_64
Auto detected arch: amdfam10
Vendor-ID: AuthenticAMD
CPU-Family: 16
CPU-Model: 6
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowe xt 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 pop cnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt npt lbrv svm_lock nrip_save
gcc version 4.5.2 (Ubuntu/Linaro 4.5.2-8ubuntu4)
Using compilers "native" flags
### FFdeCSA TEST ###
Using compiler: g++
Flags: -march=native -fexpensive-optimizations -fomit-frame-pointer -funroll-loops
Testing optimization levels 2 and 3
Level -O2:
PARALLEL_32_INT
- 162, 280, 281, 124, 120, 120, 286, 116, 280, 281
- 286 Mbit/s max.
PARALLEL_64_2INT
- 110, 259, 84, 109, 260, 259, 259, 114, 265, 265
- 265 Mbit/s max.
PARALLEL_64_LONG
- 166, 402, 398, 157, 408, 408, 408, 130, 207, 411
- 411 Mbit/s max.
PARALLEL_64_MMX
- 185, 138, 101, 155, 369, 184, 149, 236, 380, 250
- 380 Mbit/s max.
PARALLEL_128_2LONG
- 165, 96, 182, 350, 291, 190, 209, 307, 138, 128
- 350 Mbit/s max.
PARALLEL_128_2MMX
- 163, 339, 339, 179, 345, 345, 346, 152, 338, 338
- 346 Mbit/s max.
PARALLEL_128_SSE
- 189, 438, 438, 438, 236, 428, 117, 207, 133, 219
- 438 Mbit/s max.
PARALLEL_128_SSE2
- 219, 500, 200, 488, 183, 499, 182, 134, 499, 166
- 500 Mbit/s max.
Fastest PARALLEL_MODE = PARALLEL_128_SSE2 (500 Mbit/s)
Level -O3:
PARALLEL_32_INT
...failed!
PARALLEL_64_2INT
...failed!
PARALLEL_64_LONG
- 118, 367, 366, 152, 128, 367, 367, 366, 135, 375
- 375 Mbit/s max.
PARALLEL_64_MMX
- 138, 289, 344, 345, 344, 344, 344, 124, 353, 353
- 353 Mbit/s max.
PARALLEL_128_2LONG
- 141, 360, 361, 361, 361, 360, 98, 303, 368, 133
- 368 Mbit/s max.
PARALLEL_128_2MMX
- 122, 298, 96, 168, 352, 204, 210, 313, 351, 352
- 352 Mbit/s max.
PARALLEL_128_SSE
- 200, 453, 202, 444, 445, 444, 444, 444, 442, 443
- 453 Mbit/s max.
PARALLEL_128_SSE2
- 137, 510, 169, 228, 301, 310, 514, 330, 160, 225
- 514 Mbit/s max.
Fastest PARALLEL_MODE = PARALLEL_128_SSE2 (514 Mbit/s)
Best result with -O3 and PARALLEL_128_SSE2 at 514 Mbit/s
### VDR-SC FFdeCSA Makefile OPTS ###
CPUOPT ?= native
PARALLEL ?= PARALLEL_128_SSE2
CSAFLAGS ?= -O3 -fexpensive-optimizations -fomit-frame-pointer -funroll-loops
### GENERIC FFdeCSA make OPTS ###
FLAGS="-O3 -march=native -fexpensive-optimizations -fomit-frame-pointer -funroll -loops" PARALLEL_MODE=PARALLEL_128_SSE2