3 repos
Observability Architectures & Standards — System Administration & Monitoring
We curate 3 GitHub repositories matching system administration & monitoring · Observability Architectures & Standards. Refine with filters or upvote what's useful.
Observability Architectures & Standards — System Administration & Monitoring
- langchain-ai/langchain
langchain-ai/langchain
127,015LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows that manage state, memory, and tool execution. The project distinguishes itself through a durable execution runtime that maintains persistent state across long-running processes by checkpointing progress to external storage. It models agent workflows as directed graphs, allowing for explicit node-to-node routing and state management. Furthermore, it includes a human-in-the-loop control layer that enables developers to pause execution at defined breakpoints, allowing for manual inspection, modification, and approval of agent actions during runtime. Beyond its core orchestration capabilities, the framework supports a tiered memory architecture that separates short-term conversation context from long-term persistent data. It also provides comprehensive observability tools for tracing and monitoring execution flows, alongside security features for managing authentication and fine-grained access control. The platform is supported by extensive documentation and standardized interfaces for models, embeddings, and data sources to facilitate the development of production-grade agentic systems.
Pythonagentsaiai-agents - modelcontextprotocol/servers
modelcontextprotocol/servers
79,000The Model Context Protocol is a standardized communication framework designed to connect language models to external data sources, functional tools, and interactive user interfaces. It provides a vendor-neutral interface layer that enables AI hosts to discover and execute capabilities across heterogeneous service environments, using a JSON-RPC based messaging standard to facilitate bidirectional communication between clients and servers. The protocol distinguishes itself through a robust capability-based handshake that negotiates feature sets during session initialization, ensuring compatibility and supporting graceful degradation when client and server capabilities are mismatched. It enforces security through a mediation framework that manages isolated connections, implements least-privilege access controls, and provides standardized authorization flows. By executing server instances as independent, host-managed processes, the protocol maintains strict security boundaries while allowing for modular growth through a defined lifecycle for protocol extensions. Beyond its core messaging and security primitives, the protocol covers a broad range of integration needs, including structured resource access, schema-defined tool invocation, and parameterized prompt templates. It supports advanced interaction patterns such as asynchronous task management with durable handles, interactive UI rendering, and dynamic user input elicitation. The ecosystem also includes developer tooling for session management, server metadata discovery, and diagnostic inspection to assist in the integration of local and remote services.
TypeScript - netdata/netdata
netdata/netdata
77,812Netdata is a distributed observability platform designed for real-time infrastructure monitoring and performance tracking. It functions as a high-frequency agent that collects system, container, and application metrics with per-second precision, providing both local visualization and centralized aggregation across complex, multi-cloud environments. The platform distinguishes itself through edge-based intelligence, utilizing local machine learning models to automatically detect performance anomalies without requiring manual configuration or external query engines. Its architecture prioritizes local-first data persistence and secure metadata-only synchronization, ensuring that granular observability data remains on the host while essential system information is routed to a cloud-connected management plane. This hierarchical approach allows for horizontal scaling through parent-child node relationships, enabling unified monitoring and alerting across distributed infrastructure. Beyond core collection and analysis, the system supports automated troubleshooting through natural language querying and intelligent metric correlation. It features a modular data acquisition engine that employs thread-per-core execution for low-latency performance, alongside isolated external processes for heterogeneous application support. The platform includes automated service discovery, diverse deployment options, and built-in diagnostic utilities to maintain visibility and connectivity across large-scale clusters. Installation is supported through various methods including package managers, automated scripts, source compilation, and containerized orchestration.
Caialertingcncf