Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing files in 2012 #30

Closed
rolfvandekrol opened this issue Apr 12, 2013 · 13 comments
Closed

Missing files in 2012 #30

rolfvandekrol opened this issue Apr 12, 2013 · 13 comments

Comments

@rolfvandekrol
Copy link

I'm trying to download the full archive for 2013, and saw that these files result in 404.

http://data.githubarchive.org/2012-03-01-8.json.gz
http://data.githubarchive.org/2012-03-05-4.json.gz
http://data.githubarchive.org/2012-03-05-5.json.gz
http://data.githubarchive.org/2012-03-05-6.json.gz
http://data.githubarchive.org/2012-03-05-7.json.gz
http://data.githubarchive.org/2012-03-10-9.json.gz
http://data.githubarchive.org/2012-03-10-10.json.gz
http://data.githubarchive.org/2012-03-10-11.json.gz
http://data.githubarchive.org/2012-03-10-12.json.gz
http://data.githubarchive.org/2012-03-10-13.json.gz
http://data.githubarchive.org/2012-03-10-14.json.gz
http://data.githubarchive.org/2012-03-10-15.json.gz
http://data.githubarchive.org/2012-03-10-16.json.gz
http://data.githubarchive.org/2012-03-10-17.json.gz
http://data.githubarchive.org/2012-03-10-18.json.gz
http://data.githubarchive.org/2012-03-10-19.json.gz
http://data.githubarchive.org/2012-03-10-20.json.gz
http://data.githubarchive.org/2012-03-10-21.json.gz
http://data.githubarchive.org/2012-03-11-2.json.gz
@igrigorik
Copy link
Owner

Hmm. Double checked my raw archives, and indeed those are missing. Will see if I can fill it in, but unfortunately this may be non-recoverable..

@rolfvandekrol
Copy link
Author

Hmm, that would be a shame. We'll see.

@igrigorik
Copy link
Owner

Particular reason why you need those specific hours / days? :)

@rolfvandekrol
Copy link
Author

Was playing with the data for the github data challenge. But probably won't
miss those few hours. Just noticed they were missing.
On Apr 13, 2013 7:44 PM, "Ilya Grigorik" notifications@github.com wrote:

Particular reason why you need those specific hours / days? :)


Reply to this email directly or view it on GitHubhttps://github.com//issues/30#issuecomment-16337452
.

@igrigorik
Copy link
Owner

Gotcha - thanks for the heads up.

@davidfischer
Copy link

Seems to be something interesting on March 5. Something similar applies to 2013.

http://data.githubarchive.org/2013-03-05-20.json.gz

That archive is missing as well.

@karan
Copy link

karan commented Jul 6, 2013

Same for April 2012 (the example is readme)

http://data.githubarchive.org/2012-04-{01..31}-{0..23}.json.gz

@igrigorik
Copy link
Owner

@thekarangoel any files in particular? seems to work for me. Be careful with bash expansions.. in older versino of bash 01..31 doesn't work (returns 1,2,3.. instead of 01,02,03..)

@karan
Copy link

karan commented Jul 6, 2013

Yeah that seems to work. One question, probably a naive one, how do I download all of the archive for 2012 and 2013?

@igrigorik
Copy link
Owner

http://data.githubarchive.org/201{2,3}-{01..12}-{01..31}-{0..23}.json.gz :)

@karan
Copy link

karan commented Jul 6, 2013

Amazing. Just want to build something cool with the data. :)

@ycha28
Copy link

ycha28 commented Jul 14, 2013

Two of the three http addresses listed in the readme are not working for me. Here's the error that I see when I type "http://data.githubarchive.org/2012-04-11-{0..23}.json.gz" into my browser:

This XML file does not appear to have any style information associated with it. The document tree is shown below.

NoSuchKey
The specified key does not exist.

@karan
Copy link

karan commented Jul 14, 2013

The "..." in the URL means you need to fill in all the numbers. You can use the "..." format in the terminal though with wget command.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants