ARIMA and similar models assume some sort of causal relationship between past values and past errors and future values of the time series: $$Y_{t+h}=f(Y_{t},Y_{t-1},Y_{t-2},....,\epsilon_{t},\epsilon_{t-1},\epsilon_{t-2},...)$$ e.g. the volatility of a stock today is causally driven by the volatility of that stock yesterday and two days ago, the population of a species this year is a direct function of the population of that same species last year, etc...
Facebook Prophet doesn't look for any such causal relationships between past and future. Instead, it simply tries to find the best curve to fit to the data, using a linear or logistic curve, and Fourier coefficients for the seasonal components. There is also a regression component, but that is for external regressors, not for the time series itself (The Prophet model is a special case of GAM - Generalized Additive Model).
Theoretically speaking, the assumptions underlying Prophet are indeed simplistic and weak - just fit the best curve to your historical data. Since fitting a curve to a limited data set over a specific time period doesn't impose any constraints on how the curve behaves outside of your historical data set, it is entirely possible that the best fitting curve will "go off the rails" outside of the historical time interval. For example, I have often noticed that Prophet can go negative in the future, even if the historical data set has only positive values, because the simplistic assumptions mean that it will naively perpetuate a downward trend forever.
This why prophet is recommended only for time series where the only informative signals are (relatively stable) trend and seasonality, and the residuals are just noise.
In theory, a more rigorous causal or structural approach is more likely to capture signals that will extrapolate into the future. More importantly, if the residuals are not just noise, then an ARIMA model or a Neural Network might be able to capture those relationships...in theory.
In practice, outside of the examples I mentioned above and a few others, the chances of finding a business time series where the underlying data generating process involves a causal relationship of the type $Y_{t+h}=f(Y_{t},Y_{t-1},Y_{t-2},...)$ are very slim. Think about it: why would sales for a grocery or fashion item ever be driven by a process of the form $Y_t = a_1Y_{t-1}+...a_nY_{t-n}+c+\sigma(t)$?
What causal mechanism would there be that says your sales of butter this week should be a linear combination of your butter sales last week and your butter sales from two weeks ago? Or that your web traffic today should be a linear combination of your web traffic from yesterday, two days ago, three days ago, and last week?
So at the end of the day, the assumptions of ARIMA and similar models end up being so strong and implausible that, for all of their mathematical rigor, they are just as add-hoc in practice as Prophet or Holt-Winters.
So the simplicity of Prophet's approach in practice makes sense for a lot of business time series. Moreover, the authors acknowledge this in their paper.