7 repos
Storage Systems — Data & Databases
We curate 7 GitHub repositories matching data & databases · Storage Systems. Refine with filters or upvote what's useful.
Storage Systems — Data & Databases
- torvalds/linux
torvalds/linux
217,986The Linux kernel is a monolithic operating system kernel that serves as the primary interface between computer hardware and software applications. It provides the foundational infrastructure for managing system resources, including memory allocation, process scheduling, and synchronization primitives. The project includes comprehensive support for diverse storage architectures through its filesystem suite and manages complex networking, virtualization, and power management subsystems. Beyond core system management, the kernel offers extensive frameworks for hardware interaction, covering input devices, audio, sensors, and various bus communication protocols. It incorporates diagnostic tools for system observability, security mechanisms for integrity protection, and a kernel-level virtual machine for sandboxed execution. The project maintains stability through defined interface guarantees and supports modular development, including integrated support for memory-safe programming.
C - trimstray/the-book-of-secret-knowledge
trimstray/the-book-of-secret-knowledge
206,980This project serves as a centralized, community-driven repository of technical knowledge and administrative resources. It provides a structured taxonomy that aggregates disparate information into a searchable framework, supporting continuous learning and rapid problem-solving for system administrators and cybersecurity practitioners. By mapping resources across offensive security, infrastructure management, and software development, it offers a unified path for skill acquisition and professional reference. The project is defined by a command-line-first design philosophy, prioritizing terminal-based utilities and scriptable interfaces to facilitate efficient system administration and repeatable security workflows. It distinguishes itself through a platform-agnostic approach, maintaining documentation and operational guides that remain applicable across diverse Unix-like and cloud-based environments. This modular toolchain integration allows users to compose custom environments tailored to specific administrative or security tasks. The repository covers a broad capability surface, including comprehensive toolkits for system auditing, network management, and infrastructure hardening. It provides structured learning paths for cybersecurity skill development, ranging from ethical hacking labs and penetration testing standards to vulnerability assessment and system configuration best practices. The collection also encompasses a wide array of productivity tools, diagnostic utilities, and educational materials designed to streamline routine maintenance and enhance overall security posture.
awesomeawesome-listbsd - Significant-Gravitas/AutoGPT
Significant-Gravitas/AutoGPT
181,891AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, including task scheduling, execution monitoring, and configuration management, while offering a marketplace for discovering and sharing community-built workflows. The project includes a legacy framework for command-line agent execution and an extensible component system for developers to build custom agent capabilities. These tools allow for the integration of various language models, web search utilities, and external services such as database management, productivity platforms, and software development tools. Users can deploy the platform locally using provided installation scripts and containerization utilities or utilize the managed cloud environment.
Pythonaiartificial-intelligenceautonomous-agents - kubernetes/kubernetes
kubernetes/kubernetes
120,673Kubernetes is a distributed container orchestration platform that automates the deployment, scaling, and management of containerized applications across clusters of computing nodes. It functions as a declarative infrastructure controller, utilizing a control loop architecture that continuously monitors the current system state against user-defined configurations to ensure desired operational outcomes. The system relies on a centralized API-driven interface and a replicated key-value store to maintain a consistent source of truth for all cluster objects. The platform distinguishes itself through a highly extensible design that allows users to define domain-specific objects using the same native API and control loop infrastructure. It employs a standardized abstraction layer for container runtimes, enabling modular execution engines, and utilizes a pluggable controller pattern that supports third-party integrations without requiring modifications to the core codebase. An algorithmic bin-packing engine further optimizes hardware utilization by dynamically matching workload requirements with available cluster capacity. Beyond core orchestration, the system provides comprehensive operational support for distributed environments, including automated lifecycle management, horizontal and vertical scaling, and self-healing mechanisms that maintain service availability. It encompasses integrated solutions for networking, persistent storage orchestration, and secure secret management. Diagnostic utilities for monitoring performance metrics, aggregating logs, and troubleshooting infrastructure-level issues are also included to support cluster health and reliability.
Gocncfcontainersgo - immich-app/immich
immich-app/immich
92,953Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining full control over data ownership and storage infrastructure, the platform ensures that users retain sovereignty over their digital assets. The system distinguishes itself through a distributed architecture that coordinates background media synchronization, real-time filesystem monitoring, and automated deduplication. It leverages an integrated machine learning pipeline to perform intelligent asset organization, including facial recognition, object detection, and metadata extraction. These processes are executed through containerized service orchestration, which manages complex dependencies and hardware-accelerated tasks within isolated environments. Beyond core management, the platform provides extensive tools for disaster recovery and library maintenance. Users can configure automated database backups, manage external storage volumes, and define granular synchronization policies for mobile devices. The system also includes command-line utilities for secure remote operations, such as authenticated asset uploading and server version verification, ensuring compatibility and consistency across distributed deployments.
TypeScriptbackup-toolfluttergoogle-photos - macrozheng/mall
macrozheng/mall
82,926This project is an enterprise-grade Java framework designed for building scalable, full-stack e-commerce applications. It provides a comprehensive foundation for microservice-based distributed architectures, enabling the development of complex retail platforms that include product management, order processing, and secure user authentication. By leveraging modular service patterns and centralized API gateways, the framework supports the construction of resilient systems that decompose monolithic business logic into independent, manageable services. The platform distinguishes itself through a robust suite of infrastructure and operational tools that facilitate high-scale deployments. It features integrated support for container-orchestrated environments, event-driven message brokering, and centralized security via token-based authentication. To ensure operational visibility, the framework includes a centralized log aggregation pipeline, real-time health monitoring, and distributed system observability, allowing teams to maintain stability across complex service boundaries. Beyond its core architecture, the platform offers extensive developer tooling and data management capabilities. It supports advanced database operations, including read-write splitting, query routing, and data synchronization, alongside integration with distributed search engines and object storage systems. The development environment is further enhanced by utilities for code quality enforcement, automated entity generation, dependency management, and architectural visualization, providing a complete ecosystem for the lifecycle of enterprise-grade web applications.
Javadockerelasticsearchelk - elastic/elasticsearch
elastic/elasticsearch
76,163Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintaining global cluster state through a distributed consensus mechanism. The platform distinguishes itself through its integrated approach to observability, security, and advanced analytics. It combines full-text, vector, and hybrid search capabilities with machine learning-driven insights, allowing users to perform complex statistical aggregations, geospatial analysis, and automated anomaly detection. Its storage architecture supports multi-tier data lifecycles, enabling efficient data placement across hot, warm, and cold nodes to balance performance with long-term retention requirements. Beyond core search and storage, the system provides comprehensive observability tools for centralized log analysis, application performance monitoring, and infrastructure health diagnostics. It includes built-in security operations for threat detection and endpoint protection, all managed through a unified RESTful API gateway. The system is accessible via standardized REST APIs for cluster management, data ingestion, and query execution. Extensive documentation is available to guide users through API references for search, indexing, security, and cluster administration.
Javaelasticsearchjavasearch-engine