When chatbots are faced with human interaction containing similes and idioms, their performance falls to between 10 to 20%.