koosdoos,
Maybe you can try to put all the features you have directly, and see what's the result.
After that, convert them into binary and see how's the result. For example, the centre point coordinate is (125,200), you convert them to binary, lets 8bits, so the coordinate already 16bits. Then plus the other features you have. So, guess that you have more than 100bits as inputs. I'm not sure this way can work or not, you can have a try.
You should take few images from same finger, so I guess you have around 20sets features from each finger. If you want to recognize 5 fingerprint, then you have 100 sets of features. You target output can be indicating the finger. Lets say 001 is first finger, 010 is second, 011 is third and so on. Then train your network using all these data until the network is converged.
It's advisable you test your network using different kinds of inputs and outputs then determine which way is the best for you.