2 minute read

Wayback Machine home page

8 captures

27 Nov 2020 - 07 Oct 2024

     
Apr OCT Nov
23:32:10 Apr 18, 2024 07  
2023 2024 2025

success

fail Share via My Web Archive Sign InGet some help using the Wayback MachineClose the toolbar

screenshotvideoShare on FacebookShare on Twitter

About this capture

COLLECTED BY

Collection: Common Crawl

Web crawl data from Common Crawl.

TIMESTAMPS

The Wayback Machine - https://web.archive.org/web/20241007111148/https://blog.tidelift.com/our-second-libraries.io-open-data-release-has-arrived

RSVP! Top findings from the 2024 Tidelift state of the open source maintainer report 📊

Our second Libraries.io open data release has arrived

Tidelift

by Tidelift

on November 30, 2017

Updated on March 14, 2018

x

Don’t miss the latest from Tidelift

Today we’re publishing another Libraries.io open data release with over 311 million rows of metadata about open source projects and the network of dependency data that connects them all.

Six months ago we published our first open data as part of our commitment to the Alfred P. Sloan and Ford Foundations. The data supports academics looking into trends in software development, investors to understand the success of projects they support, and developers to understand how their software is used more effectively than ever before.

Last week we announced that Libraries.io has joined forces with Tidelift to make open source software work better for developers and users. Libraries.io’s mission hasn’t changed and we’re going to continue publishing open data releases every quarter to build a stronger, more informed open source ecosystem.

Since our last release the Libraries.io dataset has grown significantly, today we’re releasing data on:

  • 34 package managers

  • 2.7 million projects

  • 11 million versions

  • 66 million project dependencies

  • 31 million repositories

  • 161 million repository dependencies

  • 10 million manifest files

  • 46 million git tags

The data is available in its raw format on Zenodo and we’re working on getting it published as a structured, queryable dataset on Google’s BigQuery. If you’d like to build tools on top of the most recent data, or top up your dataset to keep it current, check out the Libraries.io REST API.

For further documentation, check out our dedicated open data page. Also check out the article Ben wrote for opensource.com to get more ideas of things you can do with the data.

This data is published under a Creative Commons BY-SA-4.0 licence. It’s an open and free licence that commits the user to redistributing their work, and their understanding. And don’t forget, Libraries.io is open source, so if you’d like to get involved we can help you get started—check out the Contributors Handbook:

Finally, if you’d like regular updates from Tidelift on news like this, sign up here.

Data, Libraries.io, Dependencies, Licensing, Package Managers

You might also like:

Data, Libraries.io, Dependencies, Licensing, Package Managers
Product update: Prioritize the most impactful work with contextualized end-of-life package and version insights

Data, Libraries.io, Dependencies, Licensing, Package Managers
Product update: Using end-of-life package data to identify and eliminate bad open source packages

Updated: