What should i do with this kind of data? (price from the ads and actual price)

Question

There are two subsamples in the dataset - on one the target is real(valid), and on the other it is approximate (I don't know how it differs yet, on one sample the real price of an apartment, and on the other the price from ads, you need to predict the real one, of course). Any ideas what to do about it? I have two ideas - to normalize the target from ads (to bring the expectation and variance to an real target), and also to modify the loss so that it punishes more for an error on an real target. There are no more ideas. Therefore, I ask for help.

upd: Sorry for not giving enough details. The the problem is to predict apartment price, which is made by professional realtors. The dataset is plenty features (like amount of shops in some radius, distant to closest school, etc.), and we have two subsets in this dataset: first is dataset with prices developed by realtors, and second is subset with prices from advertisings. The goal is to predict price the way realtors would do it, but of course realtor predictions are expensive, so we don't have enough data, and we use data from advertisings as well. So i'm asking how is better to treat subset with target values from advertisings.

Please clarify your specific problem or provide additional details to highlight exactly what you need. As it's currently written, it's hard to tell exactly what you're asking. — Community, Sep 23 '21 at 17:07
This is awfully sparse, making it hard to answer. What problem are you trying to solve? How does knowing what "do about it" (do about what? What are you doing?) help you solve that problem? — Sycorax, Sep 23 '21 at 17:07
Oh, i'm sorry. I've added this information to the main topic. Please, check it. — Robohant, Sep 23 '21 at 17:54

What should i do with this kind of data? (price from the ads and actual price)

0 Answers0