T
15

That AI consultant said my dataset was too clean, turns out he was right

Guy from a startup meetup told me I was wasting time scrubbing every null value out of my training data. Ran a model on purpose after he left and it actually performed better with some missing fields included. Has anyone else found that messy real-world data works better for certain use cases?
2 comments

Log in to join the discussion

Log In
2 Comments
black.patricia
Too clean" datasets, huh. I get the theory but most real world data is a mess and training on scrubbed stuff can miss that. Saying it performs better with missing fields feels like an excuse to skip proper handling, not a genuine improvement.
5
dakotam17
dakotam1715d ago
Wait, you're saying people think a model actually performs better with missing data? That sounds backwards to me, like claiming a car runs smoother with a flat tire.
4