I took your 456 monthly values into AUTOBOX which examines series by ITERATIVELY considering anomalies , power transformations , transience in EITHER model parameters or model error variance over time .... https://autobox.com/pdfs/ARIMA%20FLOW%20CHART.pdf
It developed a model which broadly speaking which does not use a regular differencing as yours did BUT it did introduce the need for a logarithmic transformation.
Following is the identified model
and here
... (2,0,0)(1,1,0)12 in logs
The ambiguity that you encountered was to apply regular differencing or not . The acf of the seasonally differenced data is here
suggesting an ar(2) for short term memory NOT regular differencing . The message here is that the conclusion not to regularly difference and ar/ma structure should be made AFTER the seasonal differencing is in place.
The two pulses are of minor importance but might be of interest as they reflect values that were identified as extraordinary. AUTOBOX detected an ar(12) factor while you used an ma(12) .
The recent actuals and forecasts are here
with the residual acf here suggesting model sufficiency 