For pure raw speed write it in FORTH. That can be even faster than assembly. Of course you have to learn to think backwards first ...