Covid-19 coronavirus Latest Datasets
A Free, Open Resource for the Global Research Community
In response to the COVID-19 pandemic, the Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 Open Research Dataset (CORD-19), a free resource of over 29,000 scholarly articles, including over 13,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community.
This dataset is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease. The corpus will be updated weekly as new research is published in peer-reviewed publications and archival services like bioRxiv, medRxiv, and others.
Covid-19 Coronavirus Challenges
In response to the COVID-19 pandemic, the White House and a coalition of leading research groups have prepared the COVID-19 Open Research Dataset (CORD-19). CORD-19 is a resource of over 29,000 scholarly articles, including over 13,000 with full text, about COVID-19, SARS-CoV-2, and related coronaviruses. This freely available dataset is provided to the global research community to apply recent advances in natural language processing and other AI techniques to generate new insights in support of the ongoing fight against this infectious disease. There is a growing urgency for these approaches because of the rapid acceleration in new coronavirus literature, making it difficult for the medical research community to keep up.
Call to Action
We are issuing a call to action to the world’s artificial intelligence experts to develop text and data mining tools that can help the medical community develop answers to high priority scientific questions. The CORD-19 dataset represents the most extensive machine-readable coronavirus literature collection available for data mining to date. This allows the worldwide AI research community the opportunity to apply text and data mining approaches to find answers to questions within, and connect insights across, this content in support of the ongoing COVID-19 response efforts worldwide. There is a growing urgency for these approaches because of the rapid increase in coronavirus literature, making it difficult for the medical community to keep up.
A list of our initial key questions can be found under the Tasks section of this dataset. These key scientific questions are drawn from the NASEM’s SCIED (National Academies of Sciences, Engineering, and Medicine’s Standing Committee on Emerging Infectious Diseases and 21st Century Health Threats) research topics and the World Health Organization’s R&D Blueprint for COVID-19.
Many of these questions are suitable for text mining, and we encourage researchers to develop text mining tools to provide insights on these questions.
Kaggle is sponsoring a $1,000 per task award to the winner whose submission is identified as best meeting the evaluation criteria. The winner may elect to receive this award as a charitable donation to COVID-19 relief/research efforts or as a monetary payment. More details on the prizes and timeline can be found on the discussion post.
Accessing the Dataset
We have made this dataset available on Kaggle, and are periodically updating it from its source. To learn more and access the latest copy of the dataset, you can also go here: https://pages.semanticscholar.org/coronavirus-research.
The licenses for each dataset can be found in the all _ sources _ metadata csv file.
- World Health Organization (WHO): https://www.who.int/
- DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia.
- BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/
- National Health Commission of the People’s Republic of China (NHC):
- China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm
- Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html
- Macau Government: https://www.ssm.gov.mo/portal/
- Taiwan CDC: https://sites.google.com/cdc.gov.tw/2019ncov/taiwan?authuser=0
- US CDC: https://www.cdc.gov/coronavirus/2019-ncov/index.html
- Government of Canada: https://www.canada.ca/en/public-health/services/diseases/coronavirus.html
- Australia Government Department of Health: https://www.health.gov.au/news/coronavirus-update-at-a-glance
- European Centre for Disease Prevention and Control (ECDC): https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases
- Ministry of Health Singapore (MOH): https://www.moh.gov.sg/covid-19
- Italy Ministry of Health: http://www.salute.gov.it/nuovocoronavirus
coronavirus Latest research
real estimates of mortality following covid-19 infection the lancet infectious diseases march 17, 2020, david baud guillaume favre,
Clinical progression of patients with COVID-19 in Shanghai, China The Journal Of Infection March 17, 2020 Jun ChenHongzhou Lu
Back to the spring of Wuhan: facts and hope of COVID-19 outbreak Frontiers Of Medicine March 17, 2020 Guangbiao ZhouZhu Chen
New insights on the antiviral effects of chloroquine against coronavirus: what to expect for COVID-19?International Journal Of Antimicrobial AgentsMarch 17, 2020Christian A DevauxDidier Raoult
Clinical characteristics of refractory COVID-19 pneumonia in Wuhan, China Clinical Infectious Diseases : An Official Publication Of The Infectious Diseases Society Of America March 17, 2020 Pingzheng MoYongxi Zhang
Report on the Epidemiological Features of Coronavirus Disease 2019 (COVID-19) Outbreak in the Republic of Korea from January 19 to March 2, 2020 Journal Of Korean Medical Science March 17, 2020 Korean Society Of Infectious Diseases Korea Centers For Disease Control And Prevention
SARS-CoV-2 RNA more readily detected in induced sputum than in throat swabs of convalescent COVID-19 patients The Lancet Infectious Diseases March 17, 2020 Huanqin HanWeiqiang Zheng
COVID 19: Will the 2020 Hajj pilgrimage and Tokyo Olympic Games be cancelled? Travel Medicine And Infectious Disease March 17, 2020 Philippe Gautret Van Thuan Hoang
Genomic analysis of a 2019-nCoV strain in the first COVID-19 patient found in Hangzhou, Zhejiang, ChinaZhonghua Yu Fang Yi Xue Za Zhi [Chinese Journal Of Preventive Medicine]March 17, 2020H YuJ C Pan
Potential preanalytical and analytical vulnerabilities in the laboratory diagnosis of coronavirus disease 2019 (COVID-19)Clinical Chemistry And Laboratory Medicine : CCLMMarch 17, 2020Mario PlebaniMario Plebani
Clinical characteristics of severe acute respiratory syndrome coronavirus 2 reactivationThe Journal Of InfectionMarch 17, 2020Guangming YeXinghuan Wang
Skin damage and the risk of infection among healthcare workers managing coronavirus disease-2019Journal Of The American Academy Of DermatologyMarch 17, 2020Jiajia LanJuan Tao
A pathological report of three COVID-19 cases by minimally invasive autopsiesZhonghua Bing Li Xue Za Zhi Chinese Journal Of PathologyMarch 17, 2020X H YaoX W Bian
Care of haematology patients in a COVID-19 epidemicBritish Journal Of HaematologyMarch 17, 2020John WillanAndrew Peniket
The effectiveness of quarantine and isolation determine the trend of the COVID-19 epidemics in the final phase of the current outbreak in China International Journal Of Infectious Diseases : IJID : Official Publication Of The International Society For Infectious Diseases March 17, 2020 Biao TangJian hong Wu
Laboratory abnormalities in children with novel coronavirus disease 2019 Clinical Chemistry And Laboratory Medicine : CCLM March 17, 2020 Brandon Michael Henry Mario Plebani
RBD mutations from circulating SARS-CoV-2 strains enhance the structure stability and infectivity of the spike proteinBioRxiv March 17, 2020 Junxian OuQi wei Zhang
A systematic review on the efficacy and safety of chloroquine for the treatment of COVID-19 Journal Of Critical Care March 17, 2020 Andrea Cortegiani Sharon Einav
The SARS-CoV-2 outbreak: what we know International Journal Of Infectious Diseases : IJID : Official Publication Of The International Society For Infectious Diseases March 17, 2020 Di WuZhi cong Yang
Washing our hands of the problem The Journal Of Hospital Infection March 17, 2020 Chris Lynch Jim Gray