2 repos
Model Evaluation — Artificial Intelligence & Machine Learning
We curate 2 GitHub repositories matching artificial intelligence & machine learning · Model Evaluation. Refine with filters or upvote what's useful.
Model Evaluation — Artificial Intelligence & Machine Learning
- langchain-ai/langchain
langchain-ai/langchain
127,015LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows that manage state, memory, and tool execution. The project distinguishes itself through a durable execution runtime that maintains persistent state across long-running processes by checkpointing progress to external storage. It models agent workflows as directed graphs, allowing for explicit node-to-node routing and state management. Furthermore, it includes a human-in-the-loop control layer that enables developers to pause execution at defined breakpoints, allowing for manual inspection, modification, and approval of agent actions during runtime. Beyond its core orchestration capabilities, the framework supports a tiered memory architecture that separates short-term conversation context from long-term persistent data. It also provides comprehensive observability tools for tracing and monitoring execution flows, alongside security features for managing authentication and fine-grained access control. The platform is supported by extensive documentation and standardized interfaces for models, embeddings, and data sources to facilitate the development of production-grade agentic systems.
Pythonagentsaiai-agents - microsoft/generative-ai-for-beginners
microsoft/generative-ai-for-beginners
106,618This project is a comprehensive, open-source educational curriculum designed to guide developers through the mastery of generative artificial intelligence. It provides a structured learning path that covers foundational concepts, prompt engineering, and the practical application of large language models. The repository serves as a central hub for skill acquisition, offering sequential modules that progress from basic model mechanics to advanced architectural patterns. The curriculum distinguishes itself by focusing on the end-to-end lifecycle of intelligent software, including the implementation of retrieval-augmented generation and agentic workflow orchestration. It provides technical guidance on integrating diverse models—ranging from open-source options to cloud-based services—while emphasizing responsible development through systematic safety guardrails and ethical design practices. Learners are equipped to build functional applications, such as conversational interfaces, semantic search tools, and automated content generators, using standardized interfaces and modern development techniques. Beyond core model implementation, the resource covers operational practices for monitoring and maintaining AI systems in production. It includes practical modules on fine-tuning, vector-based indexing, and designing intuitive user experiences for intelligent systems. The repository is structured to support developers through every stage of the process, from initial environment configuration and dependency management to deployment readiness and troubleshooting.
Jupyter Notebookaiazurechatgpt