Era-appropriate TRW MPY12HJ 12×12 parallel multiplier chip grabs the MUL instructions from the CPU, but requires code changes ...
Doesn't really matter if it is listening to customers when the chips are this good ...