Arjun Bansal’s Post

View profile for Arjun Bansal, graphic

CEO & Co-founder at Log10

Evaluations of LLM applications is a big buzzword right now... but when do you really need new tools? Our latest blog post from Wenzhe Xue cuts through the noise to show with examples how built-in libraries such as pytest are often enough to get started. As complexity grows along the following dimensions, we're here to help! Metric based ➡ Human review Off the shelf eval or LLM as a judge ➡ Custom eval models trained on your data Offline ➡ Online, realtime https://lnkd.in/gGb3Sjah

Pytest is All You Need

Pytest is All You Need

arjunbansal.substack.com

To view or add a comment, sign in

Explore topics