I have been looking through Gorodnichy's paper ("Video-based Framework for Face Recognition in Video") from CRV'05 and it's quite an eye-opener.
Gorodnichy examines 3 factors on why humans are able to recognise faces in low resolution in video without much problems. This idea is indeed biologically motivated. What I see is actually a fresh new paradigm for video-based FR.
1. Detection: Very efficient mechanisms to detect a face prior to its recognition.
2. Decision: Accumulation of results over a period of time rather than one particular instance.
3. Learning: Efficient neuro-associative mechanisms.
Number 3 is probably interchangeable with other neural models, though I think Gorodnichy was brilliant in using just binarized information of various versions. Experimental evaluation didn't seemed to cover a large enough population of subjects, and varying pose and illumination conditions.
Nevertheless, a good read to start with...