Top AI models are getting really good at completing professional tasks, new OpenAI GDPval benchmark shows | Fortune