Since R3 is to large to drive the base and diodes, it must be reduced to feed assuming worst case hFE, which may be 30% of nom. At low Vce (2V), which is the minimum you want to output for max Pout.
So if Vmax= 68V to 8 ohms at beta=50, driver must handle 400 ohm load , then previous stage, 2K load etc. but no more than 3 emitter followers typically as Vbe drop is costly so 4th driver uses complementary open collectors to rails. With crossover base drive current source.
in your simple design, not enough base drive and too much base R loss.
When input is negative the Rdiode bias worst case will result in excess power dissipated on pull up, so many cascade stages assuming beta=50 for a nominal value of 150 roughly, are needed.
then your output Re can reduce to 0.1 or 0.05 and negative feedback overall gain lowers output impedance by Vgain amount, to give better bass response and high "dampening value " or impedance to load ratio and lower THD.
note in audioguru's excellent design, there are only two emitter follower stages in output with compl. open collectors driving Q4,Q7,Q8 on high side, same for low side. Then with all these stages AND negative feedback parasitic oscKILLation can fry the outputs so a series cap and same R value as load works to suppress this above 20kHz. Running without thus and no load speaker will ruin Amp.but with this protection will save it from no load.