Skip to content

Latest commit

 

History

History
26 lines (23 loc) · 835 Bytes

archive-rmrb.md

File metadata and controls

26 lines (23 loc) · 835 Bytes

Purpose

As a Chinese citizen, you may wanna to archive the daily paper and glance throungh them years later, which may reveal many things

a simple one

wget -r http://paper.people.com.cn/rmrb/ -A .pdf

The actual days in month and pages are smaller than what I've written here, just in case we miss any page

for year in 2018 2019 
do
  for month in 01 02 03 04 05 06 07 08 09 10 11 12
   do
        for day in 0{1..9} {10..31} ; do
         for page in  0{1..9} {10..31} ; do
            echo -ne $year$month$day$page\\r ;  #show it in the same line
            # only get the newer one,skip what we already get
            # output log to /dev/null
            wget -N paper.people.com.cn/rmrb/page/$year-$month/$day/$page/rmrb$year$month$day$page.pdf -o /dev/null;    
          done
       done
   done
done