WebDriverException:消息:服務(wù)/ content / chromedriver意外退出狀態(tài)代碼為:-6(使用ChromeDriverGoogle Colab和Selenium)
我已經(jīng)找到了有關(guān)為什么我出錯的問題的答案。請安裝chrome-chromedriver,并將其添加到您的路徑變量以及bin目錄中。
這是解決如何在Colab上使用Selenium抓取數(shù)據(jù)的完整解決方案。使用PhantomJS還有另一種方法,但是Selenium已棄用此API,希望他們在下一次Selenium更新中將其刪除。
# install chromium, its driver, and selenium!apt-get update!apt install chromium-chromedriver!cp /usr/lib/chromium-browser/chromedriver /usr/bin!pip install selenium# set options to be headless, ..from selenium import webdriveroptions = webdriver.ChromeOptions()options.add_argument(’--headless’)options.add_argument(’--no-sandBox’)options.add_argument(’--disable-dev-shm-usage’)# open it, go to a website, and get resultswd = webdriver.Chrome(’chromedriver’,options=options)wd.get('https://www.website.com')print(wd.page_source) # results
這對于想要在Google Colab上而不是在您的本地計(jì)算機(jī)上抓取數(shù)據(jù)的任何人都有效。請按相同順序依次執(zhí)行以下步驟。
您可以在https://colab.research.google.com/drive/1GFJKhpOju_WLAgiVPCzCGTBVGMkyAjtk中找到筆記本。
解決方法我試圖使用Selenium運(yùn)行無頭Chrome瀏覽器以從網(wǎng)絡(luò)上抓取內(nèi)容。我使用wget安裝了無頭Chrome,然后將其解壓縮到當(dāng)前文件夾中。
!wget 'http://chromedriver.storage.googleapis.com/2.25/chromedriver_linux64.zip'!unzip chromedriver_linux64.zip
現(xiàn)在,當(dāng)我加載驅(qū)動程序時
from selenium.webdriver.chrome.options import Optionsimport os# instantiate a chrome options object so you can set the size and headless preferencechrome_options = Options()chrome_options.add_argument('--headless')chrome_options.add_argument('--window-size=1920x1080')chrome_driver = os.getcwd() +'/chromedriver'driver = webdriver.Chrome(chrome_options=chrome_options,executable_path=chrome_driver)
我收到一個錯誤
WebDriverExceptionTraceback (most recent call last)<ipython-input-67-0aeae0cfd891> in <module>()----> 1 driver = webdriver.Chrome(chrome_options=chrome_options,executable_path=chrome_driver) 2 driver.get('https://www.google.com') 3 lucky_button = driver.find_element_by_css_selector('[name=btnI]') 4 lucky_button.click() 5 /usr/local/lib/python3.6/dist-packages/selenium/webdriver/chrome/webdriver.py in __init__(self,executable_path,port,chrome_options,service_args,desired_capabilities,service_log_path) 60 service_args=service_args,61 log_path=service_log_path)---> 62 self.service.start() 63 64 try: /usr/local/lib/python3.6/dist-packages/selenium/webdriver/common/service.py in start(self) 84 count = 0 85 while True: ---> 86 self.assert_process_still_running() 87 if self.is_connectable(): 88 break /usr/local/lib/python3.6/dist-packages/selenium/webdriver/common/service.py in assert_process_still_running(self) 97 raise WebDriverException( 98 ’Service %s unexpectedly exited. Status code was: %s’ ---> 99 % (self.path,return_code)100 )101WebDriverException: Message: Service /content/chromedriver unexpectedly exited. Status code was: -6更新資料
因此,經(jīng)過一些研究,我嘗試了另一種方法
!apt install chromium-chromedriverimport selenium as seoptions = se.webdriver.ChromeOptions()options.add_argument(’headless’)driver = se.webdriver.Chrome(chrome_options=options)
在Google Colab上,這再次給了我相同的錯誤
WebDriverException: Message: Service chromedriver unexpectedly exited. Status code was: -6
相關(guān)文章:
1. docker images顯示的鏡像過多,狗眼被亮瞎了,怎么辦?2. android - 百度地圖加載完成監(jiān)聽3. 關(guān)于docker下的nginx壓力測試4. nignx - docker內(nèi)nginx 80端口被占用5. java - 阿里的開發(fā)手冊中為什么禁用map來作為查詢的接受類?6. macos - mac下docker如何設(shè)置代理7. python3.x - git bash如何運(yùn)行.bat文件?8. dockerfile - [docker build image失敗- npm install]9. golang - 用IDE看docker源碼時的小問題10. docker api 開發(fā)的端口怎么獲取?
