[Open-graphics] Multipliers in oga1hq
Nicholas S-A
nova at macintoshclub.com
Wed Aug 15 21:11:39 EDT 2007
> There's loop control overhead, too:
> ...
> Which is 32 * (7 or 8) + 2 = 226-258 cycles, I guess. Still around
> 200,
> as you said, but it's also still an order of magnitude slower than the
> same
> algorithm implemented in hardware (and I believe it would be
> comparatively
> cheap in hardware). As for whether or not that's horrible, well, it's
> a matter
> of opinion. :)
That was actually my original version, but I didn't want to look at the
docs
to find the branch delay, and figured I could say to anybody who brought
this up that we would just unfold all the loops (which we would for
this). ;-)
Yes, I agree that hardware is much, much better, even with just an MSTEP
instruction (which could just do the two shifts and tested-add), but I
did want
to bring up that it is not <horrible> if we are pressed for space in
the XP10 and
need to do software multiply.
Cheers!
nick
More information about the Open-graphics
mailing list