peppinob-ol/attribution-graph-probing
Automates attribution-graph analysis via probe prompting: circuit-trace a prompt, auto-generate concept probes, profile feature activations, cluster supernodes.
GitHub repository with 5 stars and 0 forks.
Language: Jupyter Notebook
Topics: attribution-graphs, circuit-tracing, cross-layer-transcoder, feature-activation, graph-analysis, llm-interpretability, mechanistic-interpretability, neuronpedia, probe-prompting, prompt-probing