9 repos
Machine Learning Infrastructure — Artificial Intelligence & Machine Learning
We curate 9 GitHub repositories matching artificial intelligence & machine learning · Machine Learning Infrastructure. Refine with filters or upvote what's useful.
Machine Learning Infrastructure — Artificial Intelligence & Machine Learning
- tensorflow/tensorflow
tensorflow/tensorflow
193,864TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The system provides high-level interfaces for defining neural network architectures, alongside a robust engine for managing multidimensional array structures and tensor mathematics. The framework distinguishes itself through a scalable distributed runtime that orchestrates workloads across heterogeneous hardware accelerators and decentralized network nodes. It employs deferred-execution symbolic graphs to perform graph-level optimizations, fusion, and ahead-of-time kernel compilation for specific hardware architectures. To ensure consistent performance across production environments, it features a standardized serialization format for model graphs and specialized tools for model serving, quantization, and compression. Beyond core training capabilities, the platform includes a high-throughput data ingestion engine that supports asynchronous, multi-threaded pipelines to prevent bottlenecks. It also offers extensive support for hardware abstraction, allowing for pluggable device integration and containerized acceleration. The ecosystem is rounded out by utilities for data validation, federated learning, and specialized modeling tasks, providing a complete toolchain for moving models from research into high-availability production environments.
C++deep-learningdeep-neural-networksdistributed - huggingface/transformers
huggingface/transformers
156,730Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering specialized architectures for both text and vision processing. The framework includes tools for managing the entire model lifecycle, from data preprocessing and tokenization to distributed training and inference. The library features extensive support for model optimization and performance, including techniques like quantization, speculative decoding, and paged memory management for key-value caches. It provides native integration for distributed training across multi-node clusters, as well as flexible APIs for serving models via compatible inference servers. Developers can also utilize built-in utilities for model patching, custom kernel execution, and automated documentation generation to streamline development workflows.
Pythonaudiodeep-learningdeepseek - immich-app/immich
immich-app/immich
92,953Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining full control over data ownership and storage infrastructure, the platform ensures that users retain sovereignty over their digital assets. The system distinguishes itself through a distributed architecture that coordinates background media synchronization, real-time filesystem monitoring, and automated deduplication. It leverages an integrated machine learning pipeline to perform intelligent asset organization, including facial recognition, object detection, and metadata extraction. These processes are executed through containerized service orchestration, which manages complex dependencies and hardware-accelerated tasks within isolated environments. Beyond core management, the platform provides extensive tools for disaster recovery and library maintenance. Users can configure automated database backups, manage external storage volumes, and define granular synchronization policies for mobile devices. The system also includes command-line utilities for secure remote operations, such as authenticated asset uploading and server version verification, ensuring compatibility and consistency across distributed deployments.
TypeScriptbackup-toolfluttergoogle-photos - hacksider/Deep-Live-Cam
hacksider/Deep-Live-Cam
79,568Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a high-performance processing pipeline, the application enables live face swapping and interactive video modifications during active streaming sessions or on pre-recorded media. The system distinguishes itself through a hardware-abstraction execution layer that dynamically routes compute tasks to available graphics hardware, such as CUDA or CoreML backends. This architecture supports complex operations like multi-face mapping, where distinct target faces are applied to multiple subjects simultaneously, and preserves original mouth movements to maintain natural speech synchronization. To ensure visual fidelity, the engine employs precision mask-based blending and generative detail restoration, effectively integrating source features into target video geometry. Beyond core transformation capabilities, the application includes tools for cinematic rendering, such as real-time color grading and frame interpolation. It manages system resources through chunked memory and frame-based stream processing, which prevents crashes during intensive workloads and maintains stable performance. The interface is designed for focused workflows, offering distraction-free modes and automated projection window management to streamline the user experience during live operations.
Pythonaiai-deep-fakeai-face - tensorflow/models
tensorflow/models
77,684This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines for tasks ranging from image classification and object detection to complex sequence modeling. The project distinguishes itself by providing a flexible execution harness that manages the entire training lifecycle, including data ingestion and backpropagation. It supports scalable training across distributed hardware environments through collective communication primitives and utilizes configuration-driven experimentation to decouple hyperparameters from source code. By structuring neural architectures through hierarchical class compositions and employing checkpoint-based state persistence, the repository ensures that research workflows remain modular, reproducible, and fault-tolerant. These implementations demonstrate industry-standard patterns for constructing and deploying neural networks, including optimized graph-based execution for hardware acceleration. The repository functions as a reference for best practices in deep learning, providing documented examples for vision, language, and training loop management.
Python - redis/redis
redis/redis
73,096Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to process requests efficiently, while maintaining data durability through append-only persistence logs and asynchronous snapshotting mechanisms. What distinguishes Redis is its ability to handle complex data structures—including strings, hashes, lists, sets, and sorted sets—alongside hierarchical JSON documents and high-dimensional vector embeddings. It supports advanced operational patterns such as active-active database deployment for global distribution, real-time data streaming, and probabilistic statistics for large-scale data analysis. These capabilities are complemented by a pluggable indexing engine that enables semantic similarity matching and full-text retrieval. The platform offers a comprehensive ecosystem for managing distributed state, including master-replica replication, automated cluster management, and granular security controls like access control lists and TLS encryption. Developers can interact with the database through language-specific client libraries that support connection multiplexing and object mapping, or via a command-line interface for direct administrative tasks and scripting. Redis is deployed through standard package managers and supports both self-managed clusters and managed cloud instances. Observability is provided through integrated tools for performance analysis, slow log monitoring, and bulk data management.
Ccachecachingdatabase - twitter/the-algorithm
twitter/the-algorithm
72,764The algorithm is a distributed recommendation engine pipeline designed to construct and serve personalized content timelines. It functions as a multi-stage orchestration layer that aggregates candidate content from diverse social graphs and high-dimensional embedding spaces, processing user interaction data to deliver a unified, ranked experience. The system utilizes a high-performance machine learning serving infrastructure to execute deep learning models that predict engagement probabilities in real-time. It distinguishes itself through a hybrid retrieval strategy that combines graph-traversal techniques for discovering content outside of a user's immediate network with vector-based similarity searches to identify relevant interests. Beyond core ranking, the platform incorporates a post-ranking processing layer that applies heuristic filters to ensure content diversity, visibility preferences, and social quality safeguards. This architecture also supports multi-task learning to optimize relevance across various platform surfaces, including the integration of non-content items and personalized notifications.
Scala - CompVis/stable-diffusion
CompVis/stable-diffusion
72,380Stable Diffusion is a generative machine learning pipeline that synthesizes high-resolution visual content by performing iterative denoising within a compressed latent space. By mapping natural language embeddings into pixel outputs through conditioned probabilistic processes, the framework enables the generation of images from text prompts and the transformation of existing visual inputs based on semantic instructions. The architecture utilizes a modular execution environment that decouples model loading, scheduler logic, and inference components to support diverse hardware configurations. It distinguishes itself through a symmetric encoder-decoder backbone that preserves spatial information during refinement, alongside integrated safety filters and invisible watermarking for generated outputs. The system provides a comprehensive suite of tools for latent space generative modeling, including capabilities for inpainting, outpainting, and style transfer. These functions are exposed through standardized interfaces, allowing for the integration of advanced diffusion-based inference into broader software workflows.
Jupyter Notebook - tensorflow/tfjs-examples
tensorflow/tfjs-examples
6,783This repository provides a collection of practical demonstrations and implementation guides for machine learning tasks using TensorFlow.js. It serves as a resource for developers to explore model architectures, training workflows, and data manipulation techniques across domains such as computer vision, natural language processing, and reinforcement learning. The project covers the full lifecycle of machine learning development, including tensor-based mathematical operations, model construction via high-level layer APIs or low-level tensor logic, and model serialization for various storage mediums. It includes utilities for converting models into browser-compatible formats and provides infrastructure for executing these models across diverse backends, including WebGL, WebAssembly, and CPU-accelerated environments. Documentation and examples are organized by task type, allowing users to browse implementations for regression, object detection, and generative models. The repository also includes deployment guides for hosting server-side applications on cloud platforms, alongside tools for managing tensor memory and asynchronous training processes.
JavaScript