3 repos
Data Science — Data & Databases
We curate 3 GitHub repositories matching data & databases · Data Science. Refine with filters or upvote what's useful.
Data Science — Data & Databases
- sindresorhus/awesome
sindresorhus/awesome
438,690This project is a community-curated knowledge base that organizes vast technical ecosystems into a hierarchical, human-readable directory. It serves as a comprehensive index of libraries, frameworks, and methodologies, designed to facilitate discovery and professional development across the entire spectrum of software engineering and computer science. The directory distinguishes itself through a decentralized, peer-review model where the taxonomy evolves collaboratively via standard version-control workflows. By utilizing a markdown-based, flat-file structure, the project ensures that its curated knowledge remains platform-agnostic, accessible, and easily maintainable by the community. The repository covers a broad capability surface, including back-end and front-end development, data science, decentralized systems, and security practices. It also provides extensive educational resources, such as structured learning roadmaps, professional development guides, and specialized indexes for programming languages, hardware, and game development. The entire knowledge base is maintained as a version-controlled repository, allowing for continuous refinement and integration of new technical resources through community-driven pull requests.
awesomeawesome-listlists - jackfrued/Python-100-Days
jackfrued/Python-100-Days
178,734This project is a comprehensive, day-by-day curriculum designed to guide learners through the Python programming language and its professional applications. The content spans from fundamental syntax and object-oriented design to advanced topics including database management, web development, data analysis, and machine learning. The curriculum is structured into distinct modules that cover practical software engineering practices, such as version control, containerization, and system architecture. It also provides resources for technical interview preparation and an analysis of career paths within the software development and data science ecosystems. The material is delivered through a series of structured lessons and practical exercises.
Jupyter Notebook - awesomedata/awesome-public-datasets
awesomedata/awesome-public-datasets
72,846This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, the repository facilitates the discovery of data necessary for exploratory analysis, machine learning model training, and the development of data-intensive applications. The directory distinguishes itself through a lightweight, platform-agnostic approach to resource indexing that avoids the need for complex backend infrastructure. Content is organized using a topic-centric hierarchical taxonomy, which simplifies navigation across diverse domains ranging from climate science and economics to healthcare and computer networks. This structure is maintained through a collaborative, community-driven model where peer review and version-controlled updates ensure the ongoing accuracy and relevance of the curated links. The collection covers a broad capability surface, including specialized datasets for fields such as physics, geographic information systems, natural language processing, and time-series analysis. The repository is documented entirely through human-readable markdown files, allowing for transparent contributions and easy access to its comprehensive index of public information.
aaron-swartzawesome-public-datasetsdatasets