{"id":36889,"date":"2024-04-10T11:00:10","date_gmt":"2024-04-10T09:00:10","guid":{"rendered":"https:\/\/www.iab-forum.de\/?p=36889"},"modified":"2024-05-18T17:58:59","modified_gmt":"2024-05-18T15:58:59","slug":"the-research-data-centre-provides-a-good-example-for-other-data-areas","status":"publish","type":"post","link":"https:\/\/iab-forum.de\/en\/the-research-data-centre-provides-a-good-example-for-other-data-areas\/","title":{"rendered":"&#8220;The Research Data Centre provides a good example for other data areas&#8221;"},"content":{"rendered":"<p class=\"teaser\">The FDZ is now celebrating an anniversary: for twenty years, the Research Data Centre (FDZ) of the German Federal Employment Agency (BA) at the Institute for Employment Research (IAB) has provided external researchers with access to microdata for the purpose of non-commercial research. IAB-Forum took this as an opportunity to talk to two women who have provided direction and support for the work of the FDZ for many years: Dana M\u00fcller, Head of the FDZ, and Monika Jungbauer-Gans, Chair of the German Data Forum (RatSWD).<!--more--><\/p>\n<p class=\"question\">Ms M\u00fcller, first of all congratulations on the anniversary! To start with, we\u2019d like to ask you what you feel is special about the FDZ?<\/p>\n<div id=\"attachment_36890\" style=\"width: 189px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-36890\" class=\"wp-image-36890 size-full img-responsive\" src=\"https:\/\/iab-forum.de\/wp-content\/uploads\/2024\/04\/Forum_Portrait_Mueller-1.jpg\" alt=\"This is a portrait picture of Dana M\u00fcller.\" width=\"179\" height=\"219\" \/><p id=\"caption-attachment-36890\" class=\"wp-caption-text\"><a href=\"https:\/\/iab.de\/en\/employee\/m%C3%BCller-dana\/\">Dana M\u00fcller <\/a>has been head of the Research Data Center (FDZ) of the Federal Employment Agency at the IAB since October 2016.<\/p><\/div>\n<p>Dana M\u00fcller: The name \u201c<a href=\"https:\/\/fdz.iab.de\/en\/startseite-en\/\" target=\"_blank\" rel=\"noopener\">Research Data Centre<\/a>\u201d actually says it all \u2013 what\u2019s special about the FDZ is our data. It\u2019s truly a unique resource for labour market and occupational research. The law states that this data is to be made available for research, and it\u2019s our job to make this happen.<\/p>\n<p>We create standardised data products from administrative data provided by the Federal Employment Agency. This enables researchers worldwide to answer their research questions. My colleague in Data Management always remarks that what we do at the FDZ is to \u201crefine\u201d the data. We also provide access to IAB surveys and link them with administrative data to increase their analytic potential.<\/p>\n<p class=\"question\">What are the biggest challenges or milestones that the FDZ has faced over the last 20 years?<\/p>\n<p>M\u00fcller: 20 years sounds like a very long time \u2013 but for me it\u2019s actually gone by fairly quickly. The biggest challenge from my point of view was setting up the data infrastructure. We started out in 2005 with a single guest workstation in the office of one of my colleagues. This enabled external researchers to work with our data on-site for the first time. Over the years we\u2019ve created many additional ways to access the data. Safe rooms where researchers can access our data are now available in Germany, other European countries, the USA and Canada.<\/p>\n<p>Another challenge is the fact that our social data is subject to special protection. Data privacy is a very important aspect: that\u2019s why our projects can\u2019t be implemented overnight \u2013 they simply take time.<\/p>\n<p>One milestone that we have achieved is linking surveys to the administrative data. We started 20 years ago with the Linked Employer-Employee data sets, linking the responses of participating employers from the IAB Establishment Panel to administrative data of the people employed at those companies. For the first time, this enabled researchers to analyse the supply and demand side of the labour market at the same time.<\/p>\n<blockquote><p>Jungbauer-Gans: \u201cMy colleagues in Nuremberg would often come to the IAB to work on the data at a guest workstation.\u201d<\/p><\/blockquote>\n<div id=\"attachment_36892\" style=\"width: 189px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-36892\" class=\"wp-image-36892 size-full img-responsive\" src=\"https:\/\/iab-forum.de\/wp-content\/uploads\/2024\/04\/Forum_Portrait_Jungbauer-Gans-1.jpg\" alt=\"On the picture you can see a portrait of Monika Jungbauer-Gans\" width=\"179\" height=\"219\" \/><p id=\"caption-attachment-36892\" class=\"wp-caption-text\"><a href=\"https:\/\/www.dzhw.eu\/en\/gmbh\/mitarbeiter?m_id=661\">Prof. Monika Jungbauer-Gans<\/a> teaches at the Friedrich-Alexander University Erlangen-Nuremberg, heads the German Centre for Higher Education Research and Science Studies (DZHW) and is Chairwoman of the German Data Forum (RatSWD).<\/p><\/div>\n<p class=\"question\">Ms Jungbauer-Gans, how did you first come into contact with the BA\u2019s FDZ at the IAB?<\/p>\n<p>Monika Jungbauer-Gans: That actually does go back quite a long way. When I held a professorship at the University of Kiel I was in touch with the IAB\u2019s regional research group there. We were doing a joint analysis of diversity in companies as part of two projects funded by the <a href=\"https:\/\/www.dfg.de\/en\" target=\"_blank\" rel=\"noopener\">German Research Foundation (DFG<\/a>). For this purpose we were using precisely the IAB\u2019s linked data set that Dana M\u00fcller just mentioned. While these projects were running I moved to the University of Erlangen-Nuremberg, which of course fitted in perfectly. My colleagues in Nuremberg would often come to the IAB to work on the data at a guest workstation. That wouldn\u2019t have been so easy to do from Kiel at the time.<\/p>\n<p class=\"question\">So originally you were first and foremost an FDZ user?<\/p>\n<p>Jungbauer-Gans: Yes, exactly. Later, as a member of the IAB Scientific Advisory Council, I was even more aware of the work being done of course, and that included the Institute\u2019s work with data.<\/p>\n<p class=\"question\">You\u2019re also the Chair of <a href=\"https:\/\/www.konsortswd.de\/en\/about\/ratswd\/\" target=\"_blank\" rel=\"noopener\">German Data Forum (RatSWD)<\/a>, which is dedicated to improving access to high-quality data for research purposes. Where do you currently see a particular need to catch up in terms of data access, the interlinking of data records, and data infrastructure?<\/p>\n<p>Jungbauer-Gans: I see a major problem in the fact that in Germany it\u2019s not per se possible to link anonymised data in complicance with data privacy regulations, as is the case in Scandinavian countries or in the Netherlands. In Germany data is not viewed in cross-sectoral terms, it\u2019s seen from the point of view of individual policy areas. The German Social Code only regulates the protection of labour market data, for example. It would be very worthwhile to finally be able to link education data to labour market data, for instance.<\/p>\n<p class=\"question\">Is that possible in the Netherlands and in Scandinavia?<\/p>\n<p>Jungbauer-Gans: A lot of data can be linked in Scandinavia, including register data. Depending on the question, you can link labour market and health data, for example, or data on schools and teacher qualifications. Here in Germany, these data spaces are worlds apart.<\/p>\n<blockquote><p>Jungbauer-Gans: \u201cIn Germany, we have to trawl through all kinds of things to find out what data is available where.\u201d<\/p><\/blockquote>\n<p class=\"question\">Even though the mindset in the European Union is much more progressive, does the EU still tend to think more in terms of data spaces?<\/p>\n<p>Jungbauer-Gans: The European Union has in fact been a relatively strong pioneer. <a href=\"https:\/\/eur-lex.europa.eu\/EN\/legal-content\/summary\/european-data-governance.html\" target=\"_blank\" rel=\"noopener\">The Data Governance Act<\/a> says there should be a central point where you can check who has which data, for example. We\u2019re a long way away from that in Germany. Here we have to trawl through all kinds of things to find out what data is available where.<\/p>\n<p>But at EU level there\u2019s also a tendency to focus solely on certain particular data fields such as health, mobility and traffic. There\u2019s often a failure to look at the interlinking options here. As a researcher, you\u2019re always dealing with questions that are relevant to a whole range of areas. If I were a pharmaceutical company, health data would be enough for me, but if I want to analyse the impact of the coronavirus pandemic on commuter mobility, I need two data spaces.<\/p>\n<p class=\"question\">As one of 41 accredited data centres, the BA\u2019s FDZ is involved in the process that aims to ensure easier data access for research purposes in Germany. How do you rate the work done by our FDZ to date?<\/p>\n<p>Jungbauer-Gans: The BA\u2019s FDZ is one of the pioneers among the research data centres. It was one of the first to be founded after those of the statistical offices, so it did essential groundwork in terms of data privacy requirements, anonymisation of data and data access, for example. That was a truly pioneering achievement.<\/p>\n<p class=\"question\">How many new enquiries does the FDZ receive each year from researchers who have never worked with FDZ data before?<\/p>\n<p>M\u00fcller: We have around 200 new users every year. Our data is used by young researchers, too: after all, you can use our data to write a bachelor\u2019s thesis, a master\u2019s thesis or a doctoral dissertation. At the same time, our data is also used by a lot of \u201cold hands\u201d who provide policy advice, so they know exactly whether our data is particularly well-suited to answering a particular research question compared to other sources.<\/p>\n<blockquote><p>M\u00fcller: \u201cAbout half of our user projects have collaborators from academic institutions outside Germany.\u201d<\/p><\/blockquote>\n<p class=\"question\">Do you get many requests for FDZ data from abroad?<\/p>\n<p>M\u00fcller: By now, about half of our user projects have collaborators from academic institutions outside Germany. IAB data is used by well-known international researchers, too, such as Nobel Prize winner <a href=\"https:\/\/iab.de\/en\/fellows\/professor-david-card-ph-d\/\" target=\"_blank\" rel=\"noopener\">David Card<\/a>.<\/p>\n<p class=\"question\">How would a researcher based in a place like California access the FDZ\u2019s valuable data? Would she have to fly to Nuremberg?<\/p>\n<p>M\u00fcller: Not necessarily \u2013 although Nuremberg is always well worth a visit, of course. We have data access points worldwide; there has been one at the University of California in Berkeley since 2013, for instance. But it always depends on the question you\u2019re asking and how much data you need. Our data cannot simply be downloaded.<\/p>\n<p>Normally you have to start by filling out an application form online, and this asks you to meet certain requirements. Once we\u2019ve checked whether the research question can be addressed with our data, we have to anonymise the data for all data access points outside Europe in accordance with the General Data Privacy Regulation and conclude a special data use agreement with the institution where the researcher is employed. Providing there are no hiccups, the researcher will be able to work with the data within a period of around six weeks.<\/p>\n<blockquote><p>M\u00fcller: \u201cWe\u2019ve set our sights on improving access to confidential data in an international context \u2013 and that\u2019s no small undertaking.\u201d<\/p><\/blockquote>\n<p class=\"question\">Are there comparable research data centres in other countries which the FDZ cooperates with?<\/p>\n<p>M\u00fcller: In Europe, we\u2019re particularly involved with the International Data Access Network. This is a network of data providers who, like us, offer confidential data and are focused on enabling access to it across national borders.<\/p>\n<p>Here we collaborate with the <a href=\"https:\/\/www.cbs.nl\/en-gb\" target=\"_blank\" rel=\"noopener\">Central Agency for Statistics (CBS)<\/a> in the Netherlands, for example, which is the Dutch equivalent of Germany\u2019s Federal Statistical Office, if you like. Other institutions involved here are the French data providers <a href=\"https:\/\/www.casd.eu\/en\/\" target=\"_blank\" rel=\"noopener\">Secure Data Access Centre (CASD)<\/a> and also the <a href=\"https:\/\/ukdataservice.ac.uk\/\" target=\"_blank\" rel=\"noopener\">UK Data Service (UKDS)<\/a> and the <a href=\"https:\/\/www.gesis.org\/en\/home\" target=\"_blank\" rel=\"noopener\">Leibniz Institute for the Social Sciences (GESIS)<\/a>. We\u2019ve set our sights on improving access to confidential data in an international context \u2013 and that\u2019s no small undertaking.<\/p>\n<p class=\"question\">Can you give us a specific example of why such collaborations are important?<\/p>\n<p>M\u00fcller: If we\u2019re talking about data links, it would be nice to be able to pool the data of two countries so as to be able to undertake a national comparison of Germany and France, for example. At the moment, this type of analysis has to be carried out separately for each country. It would be much easier if you could analyse the data simultaneously. This is what we do in the International Data Access Network.<\/p>\n<p class=\"question\">The BA\u2019s FDZ also cooperates at national level with other research data centres in Germany. Why?<\/p>\n<p>M\u00fcller: Even though the research data centres in Germany are subject to different state laws \u2013 the Federal Statistical Office is covered by the Federal Statistics Act for example, while we\u2019re regulated by the German Social Code \u2013 we can still learn and benefit from each other. If the data protection officers at the Federal Statistical Office have previously worked on a topic or created working aids, for example, we can adapt these for our own purposes. There\u2019s no need to reinvent the wheel.<\/p>\n<p>We can also get together to create new data products. We\u2019re currently running a major collaborative project with the <a href=\"https:\/\/www.lifbi.de\/en-us\/\" target=\"_blank\" rel=\"noopener\">Leibniz Institute for Educational Trajectories\u2019 (LIfBi)<\/a> research data centre where we\u2019re interlinking the data from the National Educational Panel Study and making it available. Another example is a cooperation with the research data centre at the Centre for European Economic Research: here we\u2019ve linked administrative operating data which we\u2019ll soon be able to make available.<\/p>\n<blockquote><p>M\u00fcller: \u201cThe FDZ\u2019s own research is diverse, and it\u2019s incredibly important in terms of enabling us to provide good advice on the data.\u201d<\/p><\/blockquote>\n<p class=\"question\">The FDZ not only offers access to data and advice on its use, it also conducts research itself. Why?<\/p>\n<p>M\u00fcller: We do our own research because that\u2019s the best way to develop new data products: it enables us to take a closer look at data quality. If you set up research data products in a standardised way and you have a research project or a doctorate in progress at the same time, you delve deep into the data \u2013 and then you know how to group certain variables, for example. Or you find out that there are certain variables we shouldn\u2019t investigate further for reasons of data privacy or quality. The FDZ\u2019s own research is diverse, and it\u2019s incredibly important in terms of enabling us to provide good advice on the data.<\/p>\n<p class=\"question\">Big data has been the buzzword in recent years when it comes to collecting large amounts of data. Are these big volumes of data becoming increasingly important in research, too? And what is the role of the FDZ in this connection?<\/p>\n<p>Jungbauer-Gans: I see big data as an interesting opportunity to obtain data in areas that we haven\u2019t yet tapped into so well. However, the problem is that big data initially involves huge, unstructured and usually one-dimensional volumes of data. For example, loyalty card data shows how much a consumer buys, but reveals nothing else about them. It doesn\u2019t tell us about their regional mobility or personal background. A researcher would first have to find an interesting question with which to analyse this enormous, unstructured mass of individual numbers.<\/p>\n<blockquote><p>M\u00fcller: \u201cBig data is not the same as good data, and the effort it involves is enormous.\u201d<\/p><\/blockquote>\n<p>M\u00fcller: Exactly, which data should be used will primarily depend on the research project. Big data is not the same as good data, and the effort it involves is enormous. The IAB collected mobility data via respondents\u2019 smartphones with their consent, for example. That was millions of pieces of information. This data, was elaborately processed into mobility indicators, but it has its limitations \u2013 gaps in the data when a smartphone was switched off, for example.<\/p>\n<p class=\"question\">This is where machine learning might come into play. How do you assess the potential of artificial intelligence in terms of data processing in general and for the FDZ in particular?<\/p>\n<p>M\u00fcller: Our users already apply machine learning to our standardised data by searching for patterns, for example, and some researchers are experimenting with ChatGPT to help them write programme codes.<\/p>\n<p>In the future, artificial intelligence might be useful in the service sector, too \u2013 for advising users, for example. So, these are issues we can\u2019t avoid. But we need to keep a close eye on where developments are heading, and data privacy has to be taken into careful consideration, too.<\/p>\n<blockquote><p>Jungbauer-Gans: \u201cIt\u2019s important to me not to glorify artificial intelligence and to always take a close look at what\u2019s going on behind the scenes.\u201d<\/p><\/blockquote>\n<p>Jungbauer-Gans: It\u2019s important to me not to glorify artificial intelligence, but rather view it as a tool \u2013 and to always take a close look at what\u2019s going on behind the scenes. We need to understand and assess the algorithms before we incorporate artificial intelligence in our data work. How does it select data, and what does that mean in terms of the subject matter? Quite simply, we have to be careful not to draw the wrong conclusions.<\/p>\n<p class=\"question\">Ms M\u00fcller, looking ahead to the next 20 years: how would you like to see the FDZ develop going forward and where do you see the biggest challenge?<\/p>\n<p>M\u00fcller: First of all, I have high hopes for the Research Data Act. We urgently need a legal foundation in Germany that strengthens research and improves our ability to link data to answer research questions. I\u2019d like to see a link between our data and the data from the Family Benefits Office (Familienkasse), for example. Up to now, our administrative data has mainly enabled us to look at individuals. Human beings don\u2019t live in isolation, however, but often in partnerships and families. Information on the latter could be provided by the Family Benefits Office, and this would significantly advance our research.<\/p>\n<p class=\"question\">Ms Jungbauer-Gans, what recommendations do you have for the FDZ for the next 20 years?<\/p>\n<p>Jungbauer-Gans: Generally speaking, it\u2019s vital to remain open to innovation and to persevere in trying to improve the overall conditions in which we operate. But I don\u2019t think that\u2019s something we need to be telling the FDZ. Our colleagues are already doing this \u2013 and they provide a good example for other data areas.<\/p>\n<div class=\"dhsv-bs_callout dhsv-bs_callout-danger\"> <\/p>\n<h2 class=\"h3\">More about the FDZ<\/h2>\n<p>Located in the Institute for Employment Research (IAB), the Research Data Centre (FDZ) of the Federal Employment Agency (BA) was founded in 2004 with the aim of improving the exchange of information between research and statistics. Its mission is to make access to BA and IAB microdata transparent and standardised for the purpose of non-commercial empirical research.<\/p>\n<p>The FDZ not only provides access to data, it also helps researchers select, access and handle data. The FDZ\u2019s consulting services include analysis options, range and validity of the data. A total of 1,409 researchers made use of the FDZ\u2019s data products in the second half of 2023. The FDZ is not only a provider of data, however: it also conducts empirical research itself. Its focus here is on topics such as the linking of process and survey data, the internationalisation of data access, and content analyses relating to labour market research.<\/p>\n<p>The FDZ promotes cooperation with national and international research institutions. It now has 20 locations with data access points in seven countries. In addition to Germany, these are the USA, France, England, Luxemburg, Canada, Spain and Poland. <\/div>\n<p>Bild: beeboys\/stock.adobe.com;<\/p>\n<p>DOI: <a href=\"https:\/\/iab-forum.de\/en\/the-research-data-centre-provides-a-good-example-for-other-data-areas\">10.48720\/IAB.FOO.20240410.01<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The FDZ is now celebrating an anniversary: for twenty years, the Research Data Centre (FDZ) of the German Federal Employment Agency (BA) at the Institute for Employment Research (IAB) has provided external researchers with access to microdata for the purpose of non-commercial research. IAB-Forum took this as an opportunity to talk to two women who <a class=\"moretag\" href=\"https:\/\/iab-forum.de\/en\/the-research-data-centre-provides-a-good-example-for-other-data-areas\/\">read full article<\/a><\/p>\n","protected":false},"author":41,"featured_media":36910,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[311,193,1630,145],"tags":[1921,1922,1923,1920,1777,1562],"acf":[],"_links":{"self":[{"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/posts\/36889"}],"collection":[{"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/comments?post=36889"}],"version-history":[{"count":26,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/posts\/36889\/revisions"}],"predecessor-version":[{"id":37540,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/posts\/36889\/revisions\/37540"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/media\/36910"}],"wp:attachment":[{"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/media?parent=36889"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/categories?post=36889"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/iab-forum.de\/en\/wp-json\/wp\/v2\/tags?post=36889"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}