For some good benchmark models from the stats domain to compare data-driven approaches against have a look at this paper:
At what level are you doing the forecasts (aggregated or individual customers)? Day-ahead or intraday? What's the data resolution? Are you looking at point forecasts or probabilistic forecasts?

We have recently reviewed a bunch of papers focussed on the distribution grid and summarized it in this preprint:
We have an updated version now within the review process that we have not updated yet on arXiv that I can PM you.