I think you talk about the software design you must try to seek in pattern recognition to extract the main features in the voice the store it in memory, i think you also need to take large no of samples from each voice with different words to get the right features
i guess your way to finish this project is to develop it software
by hardware could be impossible
i am aiming to do the same project using matlab
we could share ideas..