The YouTube Faces Dataset was originally intended to be used for face recognition across videos, i.e. given two videos, are those videos of the same person or not? This will be our initial goal. We will be using the YouTube Faces with Facial Keypoints dataset. "This dataset is a processed version of the YouTube Faces Dataset, that basically contained short videos of celebrities that are publicly available and were downloaded from YouTube. There are multiple videos of each celebrity (up to 6 videos per celebrity). I've cropped the original videos around the faces, plus kept only consecutive frames of up to 240 frames for each original video. This is done also for reasons of disk space, but mainly to make the dataset easier to use." We may go further than simply identifying if two videos contain the same person and will potentially attempt to see if can we use our dataset to build a face movement model and predict what facial expression will come next. https://www.kaggle.com/selfishgene/youtube-faces-with-facial-keypoints