狠狠操夜夜甜|人妻在线中文字幕亚洲无码不卡av|一区二区欧美亚洲|日躁夜躁狠狠躁2001|亚洲,超碰,欧美|18AV成人电影|午夜成人免费在线|婷婷激情网深爱五月|色欲综合成人在线|在线美女搞黄大片

<ol id="66111"><center id="66111"></center></ol>

<address id="66111"><ruby id="66111"><delect id="66111"></delect></ruby></address>

<rp id="66111"><table id="66111"><track id="66111"></track></table></rp>

<blockquote id="66111"></blockquote>

<abbr id="66111"><table id="66111"><optgroup id="66111"></optgroup></table></abbr>

登錄注冊(cè)

做網(wǎng)站
做推廣
做電商

資訊熱點(diǎn)

服務(wù)支持
應(yīng)用與服務(wù)
關(guān)于中企

中企商學(xué)院

查看更多

中企動(dòng)力 > 頭條 > 智聯(lián)招聘招聘信息

網(wǎng)站性能檢測(cè)評(píng)分

注：本網(wǎng)站頁面html檢測(cè)工具掃描網(wǎng)站中存在的基本問題，僅供參考。

免費(fèi)檢測(cè)

智聯(lián)招聘招聘信息

編程新手，使用python爬取智聯(lián)招聘職位信息新手視頻課程

冬寒

作為python菜鳥，突如其來地想爬取智聯(lián)招聘的招聘信息，本來是想爬取職位介紹提取關(guān)鍵字做數(shù)據(jù)分析的，然而智聯(lián)的html結(jié)構(gòu)太混亂，只得放棄這個(gè)想法，先爬取了基本的職位信息存儲(chǔ)到本機(jī)的mysql數(shù)據(jù)庫中。純新手一枚，寫得相當(dāng)粗糙，湊合著看吧，具體代碼如下：

mport requests

import urllib.parse

import re

from lxml import etree

import threading

unity_url = r'http://sou.zhaopin/jobs/searchresult.ashx?jl={location}&kw={job}&sm=0&p={page}&source=0'

headers = {"User-Agent":"Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 Safari/537.36"}

def get_info_url():

info_urls = []

location = input("請(qǐng)輸入想要搜索的城市：")

loc = urllib.parse.quote(location.encode('utf-8'))

job = input("請(qǐng)輸入想要搜索的職位：")

job = urllib.parse.quote(job.encode('utf-8'))

num = input("請(qǐng)輸入想要獲取的頁數(shù)：")

for i in range(1,int(num)+1):

url = unity_url.format(location=loc,job=job,page=str(i))

page = requests.get(url, headers=headers)

page.encoding = 'utf-8'

html = page.text

r = repile('need_urls = r.findall(html)

for n_url in need_urls:

info_urls.append(n_url[1])

return info_urls

def get_infos():

info_urls = get_info_url()

jobs = []

companies = []

work_years = []

degrees = []

salarys = []

places = []

for info_url in info_urls:

print (info_url)

info_page = requests.get(info_url, headers=headers)

info_page.encoding = 'utf-8'

info_html = info_page.text

e_html = etree.HTML(info_html)

try:

job = e_html.xpath('/html/body/p[5]/p[1]/p[1]/h1')[0].text

jobs.append(job)

company = e_html.xpath('/html/body/p[5]/p[1]/p[1]/h2/a')[0].text

companies.append(company)

work_year = e_html.xpath('/html/body/p[6]/p[1]/ul/li[5]/strong')[0].text

work_years.append(work_year)

degree = e_html.xpath('/html/body/p[6]/p[1]/ul/li[6]/strong')[0].text

degrees.append(degree)

salary = e_html.xpath('/html/body/p[6]/p[1]/ul/li[1]/strong')[0].text

salarys.append(salary.split('元')[0])

place = e_html.xpath('/html/body/p[6]/p[1]/ul/li[2]/strong/a')[0].text

places.append(place)

except:

pass

return jobs, companies, work_years, degrees, salarys, places

if __name__ == '__main__':

t = threading.Thread(target=get_infos)

t.start()

此代碼只是獲取了一些職位信息的URL后寫入列表中，后續(xù)還有寫入數(shù)據(jù)庫，提取數(shù)據(jù)做數(shù)據(jù)可視化的例子

后面數(shù)據(jù)可視化的例子

400-660-5555 在線客服

最新推薦

在家賺錢小項(xiàng)目

合理避個(gè)人所得稅稅

在線網(wǎng)站制作

合理避開企業(yè)所得稅

合理避免企業(yè)所得稅

怎么開發(fā)游戲軟件

合理避免所得稅

怎么可以在家掙錢

合理避企業(yè)所得稅

會(huì)計(jì)培訓(xùn)學(xué)校

7 x 24

全國售后支持
100 倍

故障時(shí)長賠付
26 年

26年行業(yè)服務(wù)經(jīng)驗(yàn)
70 家

全國售后支持
1600+ 名

超千人的設(shè)計(jì)、研發(fā)團(tuán)隊(duì)
150 萬

服務(wù)企業(yè)客戶150萬家

Copyright ? 1999-2025 中企動(dòng)力科技股份有限公司（300.cn）版權(quán)所有京公網(wǎng)安備11030102010293號(hào) 京ICP證010249-2

在線咨詢

建站在線咨詢

獲取方案

獲取建站報(bào)價(jià)/方案

我們聯(lián)系您

微信咨詢

掃一掃添加
動(dòng)力姐姐微信

TOP

<address id="61666"></address>

<blockquote id="61666"></blockquote>

<abbr id="61666"><table id="61666"></table></abbr>

<address id="61666"><samp id="61666"></samp></address><abbr id="61666"><table id="61666"></table></abbr>

<address id="61666"><samp id="61666"><thead id="61666"></thead></samp></address>