As conventional AI benchmarking techniques prove inadequate, AI builders are turning to more creative ways to assess the ...