这个一直没通过?今天才看得beautifulsoup,不熟。。。
def _get_new_data(self, page_url, soup): res_data = {} # Url res_data['url'] = page_url print 'now page_url add in res_data !' ''' All Pins
(Pinterest) (Pinterest) #mg_node = soup.find('img', id=re.compile(r"^i-.+")).find(" ", src=re.compile(r"https://s-media-cache-ak0.pinimg.com/\d.+?")) ''' img_node = soup.find(re.compile(r"https://s-media-cache-ak0.pinimg.com/\d.+?")) # Add the img in res_data res_data["img"] = img_node print 'now img_node add in res_data !' ''' great piece for the living room or bedroom. I love the...
<div class="pinMetaWrapper"> ''' summary_node = soup.find('div', class_="pinMetaWrapper") # Add the summary in res_data res_data["summary"] = summary_node.get_text() print 'now summary_node add in res_data !' return res_data |
免责声明:本内容仅代表回答会员见解不代表天盟观点,请谨慎对待。
版权声明:作者保留权利,不代表天盟立场。
|
|
|
|