Brazilian Presidential Transition (2018) Web Archive collection derivatives

Nick Ruest, Talía Guzman-González, Sócrates Silva, Jill Baron & Samantha Abrams
Web archive derivatives of the Brazilian Presidential Transition (2018) Web Archive collection collection from the Ivy Plus Libraries Confederation. The derivatives were created with the Archives Unleashed Toolkit and Archives Unleashed Cloud. The ivy-11549-parquet.tar.gz derivatives are in the Apache Parquet format, which is a columnar storage format. These derivatives are generally small enough to work with on your local machine, and can be easily converted to Pandas DataFrames. See this notebook for examples. Domains .webpages().groupBy(ExtractDomainDF($"url").alias("url")).count().sort($"count".desc)...
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see our documentation.