My knowlegde of statistics is on a basic level, and I'm no native speaker, and my kids are running around so my concentration isn't up to much. I don't understand the "to different sources of varaition" bit.
My situation is basically
Or, are you faced with a situation of multiple inputs that are changing and affecting multiple outputs?
School assignment? You must have very advanced schools, I didn't do something like this in my studies (to be fair, I didn't specialize in statistics).
It's something I like to try for work. The black box is a real-time simulation tool (so every step takes real time + x, i.E. is "expensive"). The input parameters are 1.) constant parameters which I cannot change (vehicle data) and 2.) parameters I change during a heuristic optimization. Output: data about how the vehicle was driving and different vehicle dynamics signals.
Now, this optimization is done for different kind of vehicles, so the input 1.) is different for every vehicle, but constant during the optimization. The parameters 2.) are changed in order to reach some dynamic values.
As every step is expensive, my idea was that if a new vehicle starts the opti, I could fetch the parameters 1.) and some target output values and compute some starting values for the parameters 2.). Therefore I need some model to train with present data (could be a neural network, but I guess there is much more in the machine learning toolbox). But beforehand I want to make sure that the model is not over-determinded. This applies many to parameters 1.), so I want to know what vehicle parameters are important for the opti and which are not. I already collect data during the opti runs and I was hoping to find some method that shows some sort of correlation value between each input value (or pair, tuples...) and some output value.
Is it any clearer now? Sorry that I'm so bad at explaining...
thanks a lot,
Heiko