Latest Updates @ https://sskaje.me/cloudera-mirror/
Cloudera does not provide any mirror sites for its parcels/packages downloading, but fortunately the directory list on archive.cloudera.com is on, just create my own!
You may download CM installer from http://cloudera.rst.im/cm4/installer/latest/
And use http://cloudera.rst.im/cm4/redhat/6/x86_64/cm/4.7.0/ as your yum repo for CM 4.7.0.(No ‘latest’ stuff this time :P)
Here comes the *latest* parcels:
http://cloudera.rst.im/cdh4/parcels/latest/
http://cloudera.rst.im/search/parcels/latest/
http://cloudera.rst.im/sentry/parcels/latest/
http://cloudera.rst.im/impala/parcels/latest/
http://cloudera.rst.im/gplextras/parcels/latest/
You can add links above to CM -> Administration -> Settings -> Parcels -> Remote Parcel Repository URLs.
Also, rsync is supported. Just try:
1 |
rsync rsync://cloudera.rst.im/cloudera/ |
Script for mirroring:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
#!/bin/sh wget -crm http://archive.cloudera.com/cm4/redhat/6/x86_64/cm/4.7.0/ --accept-regex='4\.7\.0' --reject-regex='index\.html' wget -crm http://archive.cloudera.com/cdh4/parcels/latest/ --accept-regex='latest/.*el6.*' wget -crm http://archive.cloudera.com/cdh4/parcels/latest/manifest.json wget -crm http://archive.cloudera.com/search/parcels/latest/ --accept-regex='latest/.*el6.*' wget -crm http://archive.cloudera.com/search/parcels/latest/manifest.json wget -crm http://archive.cloudera.com/impala/parcels/latest/ --accept-regex='latest/.*el6.*' wget -crm http://archive.cloudera.com/impala/parcels/latest/manifest.json wget -crm http://archive.cloudera.com/sentry/parcels/latest/ --accept-regex='latest/.*el6.*' wget -crm http://archive.cloudera.com/sentry/parcels/latest/manifest.json wget -crm http://archive.cloudera.com/gplextras/parcels/latest/ --accept-regex='latest/.*el6.*' wget -crm http://archive.cloudera.com/gplextras/parcels/latest/manifest.json find archive.cloudera.com -name "index.html*" -type f -exec rm -f {} \; |
Script would be updated someday to add an md5 check for manifest.json in order to delete outdated parcels before synchronizing from cloudera.
But, yes, someday.