Imagine you have two signal sources, both at exactly the same frequency, and you are displaying them together.
If the displayed waves go up and down together, they are in phase. The time difference is zero.
If one is delayed such that its displayed wave lags behind, reaching its peaks later, the waves are not in step.
The PHASE of the delayed signal is no longer zero with reference to the first signal.
If a signal is briefly slightly increased in frequency, then brought back, its little excursion will have involved the waveform departing from a strict sine-wave, and the peaks of the wave will have gained a lead on the reference signal. once settled back on frequency, you have a signal where the PHASE is some number of degrees ahead. In this context, a whole wavelength means 360 degrees, or 2*PI radians.
The sine-wave can be plotted by thinking of a rotating arrow vector (called a phasor). The length of the arrow is the amplitude, and the arrow is rotating about its base at the centre of a circle. The revolutions per second is the frequency. It turns through 2*PI radians each revolution, and the motion of the tip can plot out a regular sinewave. The angle it has turned through is its PHASE.
A sine-wave is plotted as y=A*sine(w*t+PHASE) where w (the angle) = 2*PI*frequency.
If a wave has PHASE =0, and another has PHASE=(some angle) , then the PHASE of the second will be ahead of the first.
You can see now that wiggling about with the PHASE will give PHASE MODULATION.
You can also see that FM (frequency modulation) involves a continuous messing with the phase.
Hope this helps.