Some notes:
My frame numbers are 10 off from what is normally seen, including my first rip of an internet video. The subsequent rip straight from DVD had the frame number offset but I published data with those numbers after telling interested parties (party) that new frame numbering was in effect. The last rip for DVD had the right frame numbers but I renumbered to match the previous rip 'to avoid counfusion'. Yikes. I'm thinking of renumbering this set to correct frame numbers.
Initiation is around frame 907 in this latest series, 917 otherwise and possibly in the future.
The displacement graphs end around initiation. That's because progression is B-O-R-I-N-G from a data acquisition perspective. The stationary targets, for measuring camera wobble, run from 800-1020 to catch most of the period that's been examined to this point.
I was happy to put these graphs up because they represent the first fruits of many hours of labor in study, programming, and what have you. The process is much more sophisticated than days gone by. The data taken above uses adaptive thresholding (a must) and some blob classification, both new for me. The RGB channels are recorded individually for variable post-processing methodology, whereas they were simply averaged before.
It does just scratch the surface, though; by building my own 'SynthEyes' I get optimal control over every jot and tittle of the process. Supervised and unsupervised learning can be exploited to any degree. What I learn about the process can be rolled into code and repetitive 'intelligent' operations can be automated and parameterized.
Metadata, too. It can know when data is suspect or bad without even examining the data itself. This plot of a bad tracking has markers where size and color indicate 'quality'. Of course, the data is obviously bad but this sort of qualitative measure is useful when the distinction isn't clear ---

Frame viewing, cropping, filtering, feature extraction, tracking, and graphing of any sort all rolled into one program!