Want to objectively measure the quality and effectiveness of your LLM-based applications? In this post I discuss Trulens, the perfect tool for the job.

Engineering Leadership with a side of Quality Evangelism
Want to objectively measure the quality and effectiveness of your LLM-based applications? In this post I discuss Trulens, the perfect tool for the job.
AI is here to stay, and if you work in Quality, you might want to learn how to test it. In this post I discuss how to leverage Playwright to test an LLM.
First in a series of posts about testing AI / Large Language models. In this first post learn how to run Llama2 locally so that you can begin your testing.