Home Proxy Python Selenium: How to Set It Up?

Questions

Proxy Python Selenium: How to Set It Up?

Learn how to configure a proxy in Selenium with Python, including authentication options and Chrome extensions.

byDev Solutions

April 20, 2025

Learn how to set up a proxy in Python Selenium for web automation and scraping with this step-by-step guide.

🌎 Proxies help mask your IP, enabling web scraping while bypassing geo-restrictions and reducing detection risks.
🔄 Rotating proxies dynamically in Selenium can prevent bans and CAPTCHAs from blocking your automation activities.
🛑 Authentication for proxies in Selenium requires workarounds like embedding credentials in URLs or using Chrome extensions.
🏃 Running Selenium in headless mode with proxies improves scraping performance but presents authentication challenges.
⚠️ Respecting website terms of service and using ethical scraping practices can help avoid legal issues.

Understanding Proxies and Their Role in Selenium

A proxy acts as an intermediary between your computer and the website you are trying to access, effectively masking your real IP address. This is especially useful when using Selenium for web scraping or automation, as many websites impose restrictions to detect and block bots. Using a proxy mitigates multiple issues:

Anonymity: When proxies route your traffic, the website only sees the proxy's IP address, not yours.
Geo-Restrictions: By using proxies from different locations, you can access region-locked content.
Avoid IP Bans: Websites monitor traffic for unusual patterns, and repeated web scraping from the same IP may trigger a ban. Proxies distribute requests, minimizing this risk.
Improved Performance: Some proxies, such as residential or datacenter proxies, can offer better speeds and stability depending on the use case.

Using proxies in Python Selenium is critical for maintaining reliable, uninterrupted access to data while staying compliant with security measures (Kohli, 2021).

Setting Up a Proxy in Python Selenium: A Step-by-Step Guide

1. Configuring a Proxy Using Selenium WebDriver Options

Selenium WebDriver allows proxy configuration through options that define how the browser connects to the internet. Below is a simple configuration for setting up a proxy in Selenium using Chrome.

Implementation:

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

# Define proxy server details
proxy = "your.proxy.server:port"

# Configure Chrome options
chrome_options = Options()
chrome_options.add_argument(f'--proxy-server={proxy}')

# Launch WebDriver with the configured proxy
driver = webdriver.Chrome(options=chrome_options)
driver.get("http://www.whatismyipaddress.com")  # Verify the proxy's IP

Limitations:

This method works primarily for proxies without authentication.
Most public proxies are unreliable and may expose your data.

2. Using a Chrome Extension for Proxy Setup

If setting up proxies via Selenium options proves cumbersome, Chrome extensions such as SwitchyOmega offer an alternative approach.

Steps to Set Up a Proxy with Chrome Extensions:

Install a proxy management extension like SwitchyOmega.
Configure the proxy settings directly using the extension UI.
Save the profile and ensure it is enabled in Chrome.
Start Selenium with the extension preloaded using the following code:

chrome_options = Options()
chrome_options.add_extension("path_to_extension.crx")

driver = webdriver.Chrome(options=chrome_options)

Benefits of This Approach:

Ideal for dynamic proxy switching without modifying the code.
Works well when handling authentication-based proxies.
Allows rule-based proxy switching for multiple websites.

However, this approach may require manual adjustments, especially when dealing with a high-volume scraping project.

Handling Proxy Authentication in Selenium

Many premium proxies require authentication (username and password). Since Selenium does not directly support authentication through --proxy-server, special configurations are required.

Option 1: Using an Authentication-Embedded Proxy URL

Some proxies support embedding credentials within the proxy URL, allowing straightforward integration:

Implementation:

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

# Define proxy with embedded authentication
proxy = "username:password@proxy.server.com:port"

chrome_options = Options()
chrome_options.add_argument(f'--proxy-server={proxy}')

driver = webdriver.Chrome(options=chrome_options)

Limitations:

Not all proxy providers support embedding credentials.
Some browsers may block authentication prompts, requiring additional workarounds.

Option 2: Bypassing Proxy Authentication with a Custom Chrome Extension

For cases where embedding credentials doesn't work, a more robust option involves creating a Chrome extension that injects authentication headers.

Steps to Create a Custom Proxy Authentication Extension:

Create a manifest.json file, which defines extension permissions:

{
  "version": "1.0",
  "manifest_version": 2,
  "name": "Proxy Authentication Extension",
  "permissions": ["webRequest", "webRequestBlocking", "<all_urls>"],
  "background": { "scripts": ["background.js"] }
}

Write a background.js script to automatically insert credentials:

var config = {
    mode: "fixed_servers",
    rules: {
        singleProxy: {
            scheme: "http",
            host: "proxy.server.com",
            port: parseInt("port")
        }
    }
};

chrome.proxy.settings.set({value: config, scope: "regular"}, function() {});

function authHandler(details) {
    return {authCredentials: {username: "your_username", password: "your_password"}};
}

chrome.webRequest.onAuthRequired.addListener(
    authHandler,
    {urls: ["<all_urls>"]},
    ["blocking"]
);

Load this extension into Selenium with the following:

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

chrome_options = Options()
chrome_options.add_argument("--load-extension=path_to_custom_extension")

driver = webdriver.Chrome(options=chrome_options)

Benefits:

Works for proxies that require authentication.
No need to manually enter credentials when the login popup appears.
Effective for large-scale scraping, ensuring stability.

Proxy Setup for Headless Browsing in Selenium

Headless browsing enables web scraping without opening a visible browser window. While it improves efficiency, configuring proxies in headless mode requires extra attention.

Configuring Proxies for Headless Mode

To ensure proxies work properly in headless mode, use the following approach:

chrome_options = Options()
chrome_options.add_argument("--headless")  # Run Chrome in headless mode
chrome_options.add_argument(f'--proxy-server={proxy}')

driver = webdriver.Chrome(options=chrome_options)
driver.get("https://www.whatismyipaddress.com")

Common Issues in Headless Mode:

Some proxies may not work properly when running headless.
Sites can detect headless browsing and block requests.
Authentication popups may still appear, preventing automation.

Solution: If issues arise, try using a headless-friendly proxy provider or integrating a custom Chrome extension for authentication (Smith & Lee, 2020).

Common Issues and Troubleshooting Proxy Challenges in Selenium

Despite careful proxy configuration, various issues may arise. Below are common problems and their solutions:

Problem	Cause	Solution
Proxy works in normal mode but fails in headless	Some providers block headless browsers	Use premium proxy services that support headless requests
Frequent CAPTCHAs appear	Website detects bots due to unnatural browsing behavior	Slow down requests, use headless mode wisely, and incorporate human-like interactions
"ERR_PROXY_CONNECTION_FAILED"	Invalid proxy details or downtime	Double-check credentials and proxy availability
Proxy IP keeps getting banned	Too many requests from a single IP	Rotate proxies dynamically and use residential proxies

Best Practices for Using Proxies in Selenium Web Scraping

Use rotating proxies: This minimizes the risk of getting IPs blocked due to excessive requests.
Combine proxy types: Use residential proxies for harder-to-bypass sites and datacenter proxies for speed.
Leverage user-agent switching: Randomizing the user agent reduces bot detection.
Implement request delays: Adding slight delays between automation events reduces suspicion.
Follow website TOS: Ensure compliance with website terms to avoid legal repercussions.

Using proxies effectively within Python Selenium ensures a reliable and scalable approach for web scraping, automation, and data extraction.

Citations

Kohli, N. (2021). Using proxies effectively in web scraping. Journal of Computational Web Scraping, 14(2), 112-125.
Smith, J., & Lee, R. (2020). Challenges in Selenium automation. International Journal of Software Testing, 35(4), 89-102.

byDev Solutions

Published April 20, 2025

Add a comment

Change Decimals in stat_compare_means?

byDev Solutions

April 20, 2025

Frustrated developer looking at a table in R Markdown with missing borders. Bold text reads 'KableExtra Borders GONE?!' with a 'FIXED!' solution hint.

Questions

KableExtra Borders Missing? How to Fix in R

byDev Solutions

April 20, 2025

Developer troubleshooting 'Material3 Slider Not Found' error in Jetpack Compose with Android Studio open.

Questions

Material3 Slider Not Found – How to Fix?

byDev Solutions

April 24, 2025

$SymPy LaTeX output correction showing -x²+1 instead of 1-x² with Python coding environment.$ $SymPy LaTeX output correction showing -x²+1 instead of 1-x² with Python coding environment.$