bn/asm/bn-c64xplus.asm: update commentary.

Reviewed-by: Rich Salz <rsalz@openssl.org>
This commit is contained in:
Andy Polyakov 2015-12-21 14:26:12 +01:00
parent cfe670732b
commit b859d70d4a

View File

@ -284,8 +284,9 @@ _bn_mul_comba4:
.if 0 .if 0
BNOP sploopNxM?,3 BNOP sploopNxM?,3
;; Above mentioned m*2*(n+1)+10 does not apply in n=m=4 case, ;; Above mentioned m*2*(n+1)+10 does not apply in n=m=4 case,
;; because of read-after-write penalties, it's rather ;; because of low-counter effect, when prologue phase finishes
;; n*2*(n+3)+10, or 66 cycles [plus various overheads]... ;; before SPKERNEL instruction is reached. As result it's 25%
;; slower than expected...
MVK 4,B0 ; N, RILC MVK 4,B0 ; N, RILC
|| MVK 4,A0 ; M, outer loop counter || MVK 4,A0 ; M, outer loop counter
|| MV ARG1,A5 ; copy ap || MV ARG1,A5 ; copy ap