As debate rages over the abilities of modern AI systems, scientists are still struggling to effectively assess machine intelligence.