Peter Attia· MD
but then in the 2017 paper they actually took a Next Step which was the Insight that where exactly the thing that we were focusing on was in a sentence what was before and after the actual ordering of it mattered not just the simple cooccurrence that knowing what position that word was in a sentence actually made the difference that paper showed the performance went way up in terms of recognition and that Transformer architecture came from that paper