Saturday, 13 August 2022

Day 38 : Scrape Table from a Website using Python

 


import urllib.request

import pandas as pd


#List of publicly listed ITES companies of India

url = "https://en.wikipedia.org/wiki/List_of_publicly_listed_ITES_companies_of_India"


with urllib.request.urlopen(url) as i:

    html = i.read()

    

data = pd.read_html(html)[0]

print(data.head())


#clcoding.com

               Company       Listed  Founded            Revenue  \
0  3i Infotech Limited  BSE: 532628     1993     US$239 million   
1     HCL Technologies  BSE: 532281     1976    US$5.36 billion   
2              Infosys  BSE: 500209     1981    US$8.24 billion   
3    KPIT Technologies  BSE: 532400     1990  US$444.32 million   
4               Mastek  BSE: 523704     1982  US$150.43 million   

                 Profit Headcount Reference  
0  US$84 million (2014)      9000       [1]  
1                   NaN    197777       [2]  
2       US$1.75 billion    169638       [3]  
3                   NaN     10291       [4]  
4       US$8.37 million      3352       [5]  

0 Comments:

Post a Comment

Codecademy Code Foundations

Popular Posts

Categories

Android (23) AngularJS (1) Assembly Language (2) Books (10) C (75) C# (12) C++ (81) Course (1) Data Strucures (4) Downloads (1) Engineering (13) flutter (1) FPL (17) Hadoop (1) HTML&CSS (40) IS (25) Java (89) Leet Code (4) Pandas (2) PHP (20) Projects (19) Python (434) R (69) Selenium Webdriver (2) Software (14) SQL (27)