If you are working on an Intel Pentium class workstation, you can use the assembler instruction "rdtsc", which enables you to read a hardware counter running at cpu clock cycle speed.
You can write a assembler routine which reads this counter and loops until it reach a value corresponding to the delay you need.