I am trying to find a labeled dataset for users profiles pictures with their personality traits scores. Unfortunately, I did not find any and therefore, I decided to crawl twitter for public users profile pictures with their tweets. At that moment, I already had very good personality models that been trained on golden data of 100k users at facebook reported their personality after they answered 100-item personality-related questions. The final models are accurate and published in A* IEEE proceedings.
At this moment, I used the crawled twitter users tweets to predict their personality at the models we already have, and this gives me the ability to label their profile picture with the personality scores we predicted.
Later on, I extracted from the crawled Twitter users profile picture 50 facial features where I am correlating them using Pearson with the predicted personality scores in order to use the significantly correlated features to build another personality prediction models solely by analyzing the user profile picture.
So far, I have been with a conversation with many experienced researchers and some agree with what I am doing and some disagree, where the agreed researchers say that as long as I am finding correlated facial features with personality scores it should be scientifically correct. So I am kindly asking whether the methodology that I am following is scientifically correct or not? as many already tried to avoid answering my question.