Some points are:
1) Without buffering, the slopes of the signal at great delays will be so slow that the time at which it crosses the logical level become too sensitive. This can bring incertitudes (variations) in the delay times and perhaps glitches.
2) With your last topology, buffers should be used as well. The problem with this topology is that delays could be non-monotonic. For example, delay at tap nominally 19 ps could be greater than delay at tap nominally 20 ps, even if buffers are used. Depending of the application, this can be a problem or not.
3) Debdut, what are the tests/simulations you perform in order to check that your circuits work?