- Actualité Sciences Po
Held by Cynthia PEDROJA, Project Manager, Data Management and Web Corpora projects and Nicolas SAUGER, Associate Professor at the Centre for European Studies and Comparative Politics (CEE), Director of the Center for Socio-Political Data (CDSP), Associate Researcher at the Laboratory for Interdisciplinary Evaluation of Public Policies (LIEPP)
If your PhD project involves working and even more collecting data, data management is key for success. This course offers a review of most pressing issues when dealing with data. Organization is key for efficient work. But data management goes way beyond. A number of demands are going to guide your use of data. Journals and funders request more and more often to make data accessible and data treatment available for review. Regulations organize what kind of regulation applies to research data, how it can be collected, stored and shared. But data is now an asset in its own right. It is a way to become cited and recognized. It is a way to achieve important research results.
This course is not a course about the methods of data collection or analysis. This course is proposed by Sciences Po’s library and the CDSP.
Cynthia Pedroja and Nicolas Sauger will be the principal instructors for this course, attending all sessions. A variety of instructors from the library and CDSP will participate in animating sessions over the course of this programme.
Cynthia Pedroja is project manager in Data management at the Library.
Nicolas Sauger is Professor of political science at Sciences Po. He is the director of the CDSP and has participated in large scale project of data collection such as the European Social Survey and the Comparative Study of Electoral Systems. He has a long experience with quantitative and experimental data.
Licensing and intellectual property, metadata and contextual description, obtaining informed consent for participation in research, anonymising research data for reuse, data storage, and data archiving. It introduces key contemporary concepts such as open science and FAIR principles. It also guides participants in the discovery of available tools for better data management.
Monday 27 May : 1 p.m - 6 p.m
Tuesday 28 May : 9 a.m - 12 p.m / 2 p.m - 4 p.m
This course is organized in 5 sessions in 2 days. Its combines lectures, presentations, exercises and discussions. Participants will have begun their data management plan for their thesis during this course.
Session 1 : introduction to data management and DMPs
This session introduces the importance of data management. Key focus are: open science and FAIR principles; the added value of managing efficiently data; data lifecycles. It will discuss the role of good data management in terms of scientific integrity and re-use of the data. It will also show how to access existing data. It will also cover the Data management plan principles and tools. It will introduce the main sections of DMP and how to fill them. These tools will serve as general guidelines for the rest of the course.
Session 2: organizing and cleaning data
This session will include a theoretical part followed by a practical one built around concrete case studies to experience good practices and pitfalls in terms of data management. We will discuss the issues of versioning, how to aggregate data, file formatting, data transformation, anonymisation, how to structure your data according to your research question, etc.
- Audrey Baneyx, Data manager (medialab)
- Selma Bendjaballah, Research engineer (CDSP)
- Simon Le Corgne, Data manager (CDSP)
Session 3: Security, regulations and ethics
This session raises the question of how to deal with data collection and data use in the right way. It will review requirements and policies, as set especially in GDPR. It will introduce the issue of licensing of data and intellectual property (“IP”) rights. The ethics of data collection and data storage will be also thoroughly covered, from informed consent to anonymisation. It will question data safety and integrity. It will finally consider the issue of the security of data and for instance introduce best practices in terms of encryption.
- Marion Lehmans - Data protection offficer
- Laurence Gagelin - Information Systems Security Officer (DSI - IT Department)
- Rémi Pignal Legal council - Intellectual property (DAJAM - The legal department)
Session 4: sharing, storing, and preserving research data
This session will have a closer look at how to preserve and distribute data. It will cover metadata and their role. It will also review the question of physical and digital preservation of data. It will set out the difference between preservation and long-term archiving. It will introduce key archival standards and trusted repositories. We wil do practical exercice to depose a dataset in the future Sciences Po Repository.
- Alina Danciu, Data manager (CDSP)
- Cynthia Pedroja, Project manager in research data management (Library)
Session 5 : wrap-up and conclusion
This session concludes the course and provides a final review of data management strategy of participants.
CVs of the trainers
Cynthia PEDROJA, Project Manager, Data Management and Web Corpora projects
Nicolas SAUGER, Associate Professor at the Centre for European Studies and Comparative Politics (CEE), Director of the Center for Socio-Political Data (CDSP), Associate Researcher at the Laboratory for Interdisciplinary Evaluation of Public Policies (LIEPP)
References and Sources
Henderson, M.E. 2016 Data management, New York, Rowman & Littlefield.
Plotkin, D. 2013 Data stewardship. Burlington, M. Kaufman.
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., ... & Bouwman, J. 2016 The FAIR Guiding Principles for scientific data management and stewardship. Scientific data, 3.
Queensland University of Technology. (2013). Management of Research data. Retrieved from http://www.mopp.qut.edu.au/D/D_02_08.jsp
EPSRC, Engineering and Physical Sciences Research Council. Scope. Retrieved from www.epsrc.ac.uk/about/standards/researchdata/scope/
Crossley, J.; Stewart, J. (2013). Managing research data. Retrieved from http://www.scribd.com/doc/138079216/Managing-Research-Data
University of Sheffield. (2013, September 2). Research data management: what is research data? [blog]. Retrieved from http://researchdatamanagement.blogspot.co.uk/2013/09/what-is-research-data.html?m=1
Scott, M. (2013). Introducing Research Data, University of Southampton. Retrieved from https://eprints.soton.ac.uk/403440/1/introducing_research_data.pdf
Burnham, A. (2013). An introduction to managing research data. University of Leicester. Retrieved from http://www2.le.ac.uk/services/research-data/documents/an-introduction-to-managing-research-data
University of Leicester. (2012). Research data: definitions. Retrieved from http://www2.le.ac.uk/services/research-data/documents/UoL_ReserchDataDefinitions_20120904.pdf
For more information or to register the course, please contact firstname.lastname@example.org.
E-mail : email@example.com