There's a million choices here so duplicate the best layout recommendation with the Bill of Material (BOM) component size, e.g. 603 and placement.
Short answer , r
ead more and copy the biggest or best solution in datasheets or uC kits or uC forums and ignore some DIY advice webpages unless the person understand what SRF and Q means and has measured it.
Here I expect <10 mohm ESR for ceramic and low ESR e-caps ESR*C < 10 us so 4 uF might be 2 ohms. Yet a standard e-cap will be 100 times bigger and useless. Sometimes too many low ESR caps causes resonance issues.
long answer (written hastily with out error checking)
A VLSI uC is a noise generator on Vdd with a few capacitors on each transistor that discharges on any clock transition. You supply noise depends depends on the output conductance vs f and load noise being attenuated by a smooth RLC filter.
Everything including wires, traces, capacitors, resistors, inductors and ICs each has an equivalent RLC value. yet schematics show ideal logical traces to mean ideal R=L=C=0. but this is not reality. We make it as small as possible to neglect. Even a short skinny trace of 1mm can be 1 nH which is ~ 1.5 Ohms is 10 times bigger and cannot attenuate a supply ripply from a 1% load error supply at 5V/1A = 50 mOhm supply generating 100 mV of SMPS ripple.
The uC IC is a million little FETs with capacitance that =charge and dump every transition. This can accumulate to a significant noise level, so we must choose caps that are 100 times bigger and much lower ESR> My paper napkin estimate of an STM32 is more than 1000 pF (1nF) of noise capacitance or more, which is also in the 1 ohm range at 150 MHz. So my using low ESR caps in that range the IC noice can be attenuated. Then for lower frequency noise, we add x uF e-Caps.
Keep in mind the bandwidth is related to the rise time of uC pulses, not just the repetition rate (CLK) f=0.35/ Tr So the noise injected by a uC can spread over 150 MHz with a 2ns rise time.
This means we use an equivalent of Ohm's Law to attenuate noise at both ends which ends up being a complex low Q LPF filter design.
We consider the noise from both the source and the load by choosing caps over a wide spectrum so that the impedance at ESR*C is about 1% of the load noise source at the HF & VHF in the bands.
If all parts had ESR*C= T constant, then we would not have to consider parallel smaller parts. We would just assume like in e-caps, bigger C makes a smaller ESR, but in fact e-caps have alot of inductance in many layers inside the cap. So this bulk does not extend up into the VHF band that can be done by ceramic and plastic film.
Yet on schematics, we show C as just C but datasheets for an e-cap will have C, ESR and SRF where self-resonant frequency is due to path inductance (~0.7nH/mm typ. +/50% depending on length/width ratio) so e-caps have more effective series inductance(ESL) or simply L.
Consider schematics as logical wires but PCB traces as 7 nH/ cm and <1 to 4 pF/cm depending if there is a ground plane under the trace. Experts will estimate these and put them in their simulations with wide tolerances.
The rest of us just understand there is a good reason to use many caps to lower impedfance over a broad spectrum.
Wrap-up
In the 70's we didn't have simulation tools, so we just used 10 uF tantalum then ceramic 0.1 uF 1 nF and 100 pF NP0 in parallel ( unless it was simple analog) If we wanted to span from LF to UHF we chose from this standard then modified these guestimates as our understanding improved on impedance ratios and attenuation of noise sources and quality of passive components improved. But now with Digital VLSI the noise sources are much higher frequency so the deneral guidelines have changed. In TTL 1 cap per 10 IC's was fine. But for CMOS logic, every IC must have a cap, even if its only 1 nF for a simple IC. Because CMOS uses Enh-mode FETs and the lower the RdsON, the greater the Ciss Coss capacitance for input and output.
Low ESR ceramics are now replacing electrolytics and low ESR e-caps are also improving so they the choices are getting simpler.
This 4D nomograph is like a slide rule. You can solve for any RLCf variable with the intersection and compute Q and attenuation or gain.