前些时间论坛里一位朋友分享了关键词采集源里,觉得这个百度商情还是不错;在这里要特别感谢一下那位朋友、感谢三博
#coding=utf-8
import urllib2,sys,MySQLdb,time
import json
reload(sys)
sys.setdefaultencoding('utf-8')
conn=MySQLdb.connect(host="localhost",user="root",passwd="",db="jianshen_keyword",charset="utf8") #连接数据库
cursor=conn.cursor()
cursor.execute("SET NAMES utf8") #防止乱码
html = urllib2.urlopen('http://shangqing.baidu.com/recomword/recomWordCache_findRecomWord.htm?area_id=&word=肱三头肌').read()
d = json.loads(html)
for item in d["data"]["list"]:
dates = item["word"]
#print dates
url = 'http://shangqing.baidu.com/recomword/recomWordCache_findRecomWord.htm?area_id=&word=%s'%dates
print url
try:
html2 = urllib2.urlopen(url).read()
s = json.loads(html2)
for item1 in s["data"]["list"]:
print item1["word"]
print item1["total"]
cursor.execute("insert into gongsantouji(keyword,total) values('%s','%s')" %(item1["word"],item1["total"]))
time.sleep(0.3)
except :
continue
time.sleep(5)
有什么问题及时留言; |
评分
-
查看全部评分
发表于 2014-6-20 13:56:31
|只看大图
|