使用Python抓取模板之家的CSS模板

Python版本是2.7.9，在win8上测试成功，就是抓取有点慢，本来想用多线程的，有事就罢了。模板之家的网站上的url参数与页数不匹配，懒得去做分析了，就自己改代码中的url吧。大神勿喷！

复制代码代码如下:

          
           #!/usr/bin/env python 
  
           # -*- coding: utf-8 -*- 
  
           # by ustcwq 
  
           # 2015-03-15 
  
           import urllib,urllib2,os,time 
  
           from bs4 import BeautifulSoup 
  
           start = time.clock() 
  
           path = os.getcwd()+u'/模板之家抓取的模板/' 
  
           if not os.path.isdir(path): 
  
               os.mkdir(path) 
  
           url = "http://www.cssmoban.com/cssthemes/index_80.shtml"    # 源网站中的index后面数字怎么编排的？ 
  
           theme_url ='http://www.cssmoban.com/cssthemes/' 
  
           response = urllib2.urlopen(url) 
  
           soup = BeautifulSoup(response) 
  
           result = soup.select('p[class="title"] a') 
  
           print result 
  
           for item in result: 
  
               link = item['href'] 
  
               # down_name = item.text   # 文件名称 
  
               new_url = theme_url+link.split('/')[-1] 
  
               response = urllib2.urlopen(new_url) 
  
               soup = BeautifulSoup(response) 
  
               result = soup.select('.btn a') 
  
               down_url = result[1]['href']    # 文件链接 
  
               local = path+time.strftime('%Y%m%d%H%M%S',time.localtime(time.time()))+'.zip' 
  
               urllib.urlretrieve(down_url, local) # 远程保存函数 
  
           end = time.clock() 
  
           print u'模板抓取完成！' 
  
           print u'一共用时：',end-start,u'秒'

以上所述就是本文的全部内容了，希望大家能够喜欢。

更多文章、技术交流、商务合作、联系博主

微信扫码或搜索：z360901061

微信扫一扫加我为好友

QQ号联系： 360901061

您的支持是博主写作最大的动力，如果您喜欢我的文章，感觉我的文章对您有帮助，请用微信扫描下面二维码支持博主2元、5元、10元、20元等您想捐的金额吧，狠狠点击下面给点支持吧，站长非常感激您！手机微信长按不能支付解决办法：请将微信支付二维码保存到相册，切换到微信，然后点击微信右上角扫一扫功能，选择支付二维码完成支付。

【本文对您有帮助就好】元

2元

5元

10元

20元

自定义