cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
Highlighted
Helper I
Helper I

How to use web link from Kaggle to extract a .csv file using Web extractor?

Hello dear experts,

 

I'm trying to extract covid-19 real-time dataset from this below URL. But unfortunately, I'm not seeing a table in my Power BI - extract data pane, rather I see a .html input. Why is this and how to resolve it?

 

https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset?select=covid_19_data.csv

 

Kind regards,

Ambareesh

1 ACCEPTED SOLUTION

Accepted Solutions
Highlighted

You have a few ways of doing this (as far as I'm aware) you can retrieve the download using something like a PHP/python script and extract the .csv to a location where you can subsequently have powerBI look for it. This I think is the only way other than downloading it yourself or by searching for a stream of data. I do see that he in Kaggle references to a source, from which he retrieves and edits the data. Perhaps looking at the source gives you a link that you can in fact use in powerBI

View solution in original post

8 REPLIES 8
Highlighted
Helper I
Helper I

Hi,

 

This is because that URL in fact is a link to a webpage and not directly to a .csv  I cannot find a link directly to the CSV file on Kaggle and the download link on the page that your link currently goes to only retrieves a .zip.

 

 

Highlighted

Ok @dzuurman , now I understood why this happened. 

 

But how do I resolve this? How can I use this real-time data in my report?

 

Thanks,

Ambareesh.

Highlighted

You have a few ways of doing this (as far as I'm aware) you can retrieve the download using something like a PHP/python script and extract the .csv to a location where you can subsequently have powerBI look for it. This I think is the only way other than downloading it yourself or by searching for a stream of data. I do see that he in Kaggle references to a source, from which he retrieves and edits the data. Perhaps looking at the source gives you a link that you can in fact use in powerBI

View solution in original post

Highlighted
Resident Rockstar
Resident Rockstar

@jonnaamb 

 

It seems that you are trying to use COVID-19 datasets. Kaggle uses Johns Hopkins COVID-19 datasets (based on the content from web link that you sent). It is the best source of information for updated data.

The source of the information is this one:

https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series

 

You can connect to the dataset by using the following statement in Power Query Editor:

 

 

= Csv.Document(Web.Contents("https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_recovered_global.csv"),[Delimiter=",", Encoding=65001, QuoteStyle=QuoteStyle.None])

 

 

 

Or else you can select 'New Source' --> 'Web' --> add the link above (https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_ti...)

Highlighted
Helper I
Helper I

Alternatively, I managed to use Google Docs source directly which is present in the same webpage.

 

https://docs.google.com/spreadsheets/d/e/2PACX-1vQU0SIALScXx8VXDX7yKNKWWPKE1YjFlWc6VTEVSN45CklWWf-uW...

 

Thanks,

Ambareesh

Highlighted

Do test to see if this works after publishing your report, I do believe that this only works when logged into google docs. Im not convinced this will work after publishing.

Highlighted

@dzuurman , I used Anonymous login and it still worked.

 

Thanks, 

Ambareesh

Highlighted

Thats great, good to know ! Glad i could be of help, and do share your covid-19 dashboard/report when its done !

Helpful resources

Announcements
Community Conference

Power Platform Community Conference

Check out the on demand sessions that are available now!

Community Conference

Microsoft Power Platform Communities

Check out the Winners!

secondImage

Create an end-to-end data and analytics solution

Learn how Power BI works with the latest Azure data and analytics innovations at the digital event with Microsoft CEO Satya Nadella.

Top Solution Authors
Top Kudoed Authors