AI-in-Transportation-Lab/awesome-mechanistic-interpretability
A carefully curated collection of high-quality libraries, projects, tutorials, research papers, and other essential resources focused on Mechanistic Interpretability, a growing subfield in machine learning interpretability research that aims to reverse-engineer neural networks into understandable computational components.
GitHub repository with 110 stars and 8 forks.
Language: JavaScript
Topics: llms, mechanistic-interpretability