here your sel is 10, so its its 12 bits valid operation, so you need to pass the inputs in the consecutive three cycles.
so in the first clock cycle say if you are passing the data_in as x"F", the below thing will execute as
data_out_sig <= x"0" & data_out_sig(7 downto 0) & data_in;
here after the 1st clock (means the beginning of the 2nd clock) the data_out_sig bwcomes x"000F"
in the 2nd clock cycle say if you are passing the data_in as x"3", the below thing will execute as
data_out_sig <= x"0" & data_out_sig(7 downto 0) & data_in;
here after the 2nd clock (means the beginning of the 3rd clock) the data_out_sig bwcomes x"00F3"
in the 3rd clock cycle say if you are passing the data_in as x"7", the below thing will execute
data_out_sig <= x"0" & data_out_sig(7 downto 0) & data_in;
here after the 3rd clock (means the beginning of the 4th clock) the data_out_sig bwcomes x"0F37"
I think its better to add a reset port and an enable port to your design