Skip to content
This repository has been archived by the owner on Feb 29, 2020. It is now read-only.

download python package git repos #1

Open
tbonza opened this issue Jun 5, 2019 · 1 comment
Open

download python package git repos #1

tbonza opened this issue Jun 5, 2019 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@tbonza
Copy link
Contributor

tbonza commented Jun 5, 2019

No description provided.

@tbonza tbonza self-assigned this Jun 5, 2019
@tbonza tbonza added the enhancement New feature or request label Jun 5, 2019
@tbonza
Copy link
Contributor Author

tbonza commented Jun 16, 2019

We want to first prioritize by collecting python packages. When I review pypi.org/robots.txt

  1. Parse the PyPi sitemap.xml to get all the Python package names on PyPI
  2. Use python package ids from previous step to make API calls to PyPI (see pypi api reference docs)
  3. Store API results as JSON files for now with project name and datetime as file name so okra-20190616.json

We may be able to make parallel calls to the API by using Kubernetes Jobs

treble37 added a commit that referenced this issue Jun 25, 2019
Setup with new golang modules and golang 1.12

Install guide here: https://gist.github.com/rubencaro/5ce32fb30bbfa70e7db6be14cf42a35c

Also, download json files from pypi; addresses #1
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant