bn/asm/bn-c64xplus.asm: update commentary.
Reviewed-by: Rich Salz <rsalz@openssl.org>
This commit is contained in:
parent
cfe670732b
commit
b859d70d4a
@ -284,8 +284,9 @@ _bn_mul_comba4:
|
|||||||
.if 0
|
.if 0
|
||||||
BNOP sploopNxM?,3
|
BNOP sploopNxM?,3
|
||||||
;; Above mentioned m*2*(n+1)+10 does not apply in n=m=4 case,
|
;; Above mentioned m*2*(n+1)+10 does not apply in n=m=4 case,
|
||||||
;; because of read-after-write penalties, it's rather
|
;; because of low-counter effect, when prologue phase finishes
|
||||||
;; n*2*(n+3)+10, or 66 cycles [plus various overheads]...
|
;; before SPKERNEL instruction is reached. As result it's 25%
|
||||||
|
;; slower than expected...
|
||||||
MVK 4,B0 ; N, RILC
|
MVK 4,B0 ; N, RILC
|
||||||
|| MVK 4,A0 ; M, outer loop counter
|
|| MVK 4,A0 ; M, outer loop counter
|
||||||
|| MV ARG1,A5 ; copy ap
|
|| MV ARG1,A5 ; copy ap
|
||||||
|
Loading…
Reference in New Issue
Block a user