thomasthaddeus/DataAnalysisToolkit

DataAnalysisToolkit is a Python-based data analysis tool designed to streamline various data analysis tasks. It provides the ability to load data from CSV files, perform statistical calculations, detect outliers, clean data, and visualize data.

GitHub repository with 7 stars and 2 forks.

Language: Jupyter Notebook

Topics: data-science, matplotlib, python, python-script, python3, scikit-learn

Open provider repository

Latest metric snapshot

2026-06-05: 7 stars and 2 forks.

Similar repositories

  1. 1. chrisvdweth/selene

    An open, large-scale, interactive textbook.

    GitHub repository with 163 stars and 16 forks.

    Trending score: 1.13; stars gained: +14; forks gained: +0.

    Language: Jupyter Notebook

    Topics: ai, computer-science, edcuational, learning-resources, data-science, deep-learning

  2. 2. mito-ds/mito

    Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet

    GitHub repository with 2,637 stars and 207 forks.

    Trending score: 0.32; stars gained: +1; forks gained: +0.

    Language: Jupyter Notebook

    Topics: data-science, python, data, data-visualization, data-analysis, jupyter

  3. 3. cfneves/turma-visualizacao-de-dados

    Repositório colaborativo do curso de Análise de Dados e BI · Python · Pandas · SQL · Power BI · Tableau · Streamlit · Lab365 / SENAI SC - Visualização de Dados e Business Intelligence

    GitHub repository with 17 stars and 25 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Jupyter Notebook

    Topics: data-analysis, data-visualization, jupyter, matplotlib, pandas, plotly

  4. 4. dr-mushtaq/Machine-Learning

    A complete A-Z guide to Machine Learning and Data Science using Python. Includes implementation of ML algorithms, statistical methods, and feature selection techniques in Jupyter Notebooks. Follow Coursesteach for tutorials and updates.

    GitHub repository with 58 stars and 27 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Jupyter Notebook

    Topics: classification, machine-learning, unsupervised-learning, python, scikit-learn, data-science

  5. 5. drshahizan/HPDP

    High performance data processing employs high performance computing (HPC) to process data, which is then translated into information and knowledge. The advent of high-performance computing and data analytics enabled real-time interrogation of extremely large data sets.

    GitHub repository with 154 stars and 139 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Jupyter Notebook

    Topics: aws-certification, big-data, data-processing, data-science, dataset, high-performance-computing

  6. 6. aayushmanz/Python-For-Data-Science

    Python concepts from basics to advanced for Machine Learning learners

    GitHub repository with 6 stars and 0 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Jupyter Notebook

    Topics: beginner-friendly, data-science, jupyter-notebook, machine-learning, python-for-beginners, python-tutorial

Trending in Jupyter Notebook

  1. 1. NVIDIA/cosmos

    NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

    GitHub repository with 9,194 stars and 588 forks.

    Trending score: 2.37; stars gained: +326; forks gained: +20.

    Language: Jupyter Notebook

  2. 2. GoogleCloudPlatform/generative-ai

    Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform

    GitHub repository with 16,981 stars and 4,249 forks.

    Trending score: 1.87; stars gained: +8; forks gained: +6.

    Language: Jupyter Notebook

    Topics: generative-ai, llm, vertex-ai, langchain, gemini, gemini-api

  3. 3. DataTalksClub/llm-zoomcamp

    LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.

    GitHub repository with 5,620 stars and 1,015 forks.

    Trending score: 1.87; stars gained: +93; forks gained: +13.

    Language: Jupyter Notebook

  4. 4. Biohub/esm

    GitHub repository with 2,685 stars and 332 forks.

    Trending score: 1.82; stars gained: +48; forks gained: +12.

    Language: Jupyter Notebook

  5. 5. nerdai/llm-agents-from-scratch

    Build LLM agents and multi-agent systems from scratch, with MCP, Skills, and A2A

    GitHub repository with 131 stars and 45 forks.

    Trending score: 1.70; stars gained: +32; forks gained: +14.

    Language: Jupyter Notebook

  6. 6. openai/openai-cookbook

    Examples and guides for using the OpenAI API

    GitHub repository with 73,988 stars and 12,525 forks.

    Trending score: 1.57; stars gained: +24; forks gained: +12.

    Language: Jupyter Notebook

    Topics: openai, chatgpt, gpt-4, openai-api

Trending topic: data-science

  1. 1. apache/superset

    Apache Superset is a Data Visualization and Data Exploration Platform

    GitHub repository with 73,180 stars and 17,522 forks.

    Trending score: 2.73; stars gained: +24; forks gained: +19.

    Language: TypeScript

    Topics: analytics, apache, apache-superset, asf, bi, business-analytics

  2. 2. marimo-team/marimo

    A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

    GitHub repository with 21,312 stars and 1,126 forks.

    Trending score: 2.54; stars gained: +20; forks gained: +7.

    Language: Python

    Topics: notebooks, python, data-science, machine-learning, artificial-intelligence, data-visualization

  3. 3. streamlit/streamlit

    Streamlit — A faster way to build and share data apps.

    GitHub repository with 44,830 stars and 4,264 forks.

    Trending score: 2.52; stars gained: +25; forks gained: +4.

    Language: Python

    Topics: data-analysis, data-science, data-visualization, deep-learning, developer-tools, machine-learning

  4. 4. ray-project/ray

    Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    GitHub repository with 42,779 stars and 7,642 forks.

    Trending score: 2.37; stars gained: +15; forks gained: +13.

    Language: Python

    Topics: ray, distributed, parallel, machine-learning, reinforcement-learning, deep-learning

  5. 5. SimplifyJobs/Summer2026-Internships

    Summer 2026 software engineering, data science, AI, quant, product management, and hardware internship postings. Updated daily by Simplify and Pitt CSC.

    GitHub repository with 44,807 stars and 3,182 forks.

    Trending score: 2.09; stars gained: +17; forks gained: +1.

    Language: Python

    Topics: data-science, fall-2026, github, internship, internships, interview-preparation

  6. 6. lance-format/lance

    Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

    GitHub repository with 6,582 stars and 695 forks.

    Trending score: 1.70; stars gained: +5; forks gained: +3.

    Language: Rust

    Topics: apache-arrow, computer-vision, data-analysis, data-analytics, data-centric, data-format