To control several dvices via the parallel port don't use multplexers and/or buffers, as they work "momentary" .. you need latches ..
One option is to employ 8255 PPI ..
Here is an example:
"24-bit parallel port interface with 8255" https://jap.hu/electronic/c64pio.html
Google for more ..
You can use the parallel port's 8 data bits plus one of its control bits to give you 9 bits. General info here: **broken link removed**
If you don't need high speed, another approach is to use a shift register such as 74HCT594. It provides eight outputs, and includes an output register. You can easily chain multiple devices, all controlled by a few wires. Here's a data sheet: http://www.standardics.nxp.com/products/hc/pdf/74hc594.74hct594.pdf