aws-samples/sample-gen-ai-evaluations-workshop
This workshop teaches systematic approaches to evaluating Generative AI workloads for production use. You'll learn to build evaluation frameworks that go beyond basic metrics to ensure reliable model performance while optimizing cost and performance.
GitHub repository with 46 stars and 19 forks.
Language: Jupyter Notebook