Where AI breaks

In this section, we describe how we evaluate the tools we build with AI at B12. Through our limited evaluation of models we've shipped to production, we provide examples of actual bias, accuracy, and grammatical issues we've identified in our tools.