Any time you specify voltage, high speed and logic, one also needs to specify impedance of source, load , capacitance and current.
The fanout requirements for 1 gate load are trivial compared to level translator for a clock driver with 20 loads.
The slew rate of your Op Amp is probably the limiting factor in your design.
CMOS is a much better approach to the design because the speed, source impedance and current drive is more efficient, faster and more symmetrical.
ARM uses 25 Ohm source drivers, others range up 50 to 75 Ohms ballpark and are supply voltage dependant. The impedance is derived from specs for Vol,Vol at rated current.
(Vcc-Voh)/Ioh and Vol/Iol for ESR.
But more important in logic translators is the concept of threshold and noise margin. For HC logic it is always Vcc/2 but for backward compatibility with TLL it is 1.3V threshold, so the valid logic is 0.8 for max Vol and and 2.0V Min Voh, thus it doesn't matter if it is 1.5 or 3V if it allows or is protected from over voltage.
That was just for generalization in understanding the rules for a logic translator.
Possible solution
============
CMOS logic thresholds vary with each family but designed for best noise immunity and compatibility with other logic families.
Without knowing your receiver, loading and noise requirements, a solution cannot be stated simply but you can specify this or learn the more generalized design approach.
**broken link removed** which types are not tolerant to overvoltage, which which types are tolerant on logic level translators. (Also includes