Skip to main content

Matching data between Prolific and your external study software

When you run a study on Prolific, you'll usually download at least two datasets:

  • A demographic export from Prolific

  • Your survey response data from your external survey platform

Combining these datasets can make analysis easier by allowing you to view participant demographics alongside their survey responses in a single spreadsheet.

The exact steps may vary depending on the spreadsheet software you're using, such as Microsoft Excel, Google Sheets, or Apple Numbers.


Before you start

Open both spreadsheets side by side and locate the column containing the Prolific Participant ID in each file.

  • In the Prolific demographic export, the Participant ID is typically found in column B.

  • In your survey response export, the Participant ID column may vary depending on the platform.

Because the demographic export is usually the smaller dataset, we recommend adding the demographic information into your survey response spreadsheet.


Option 1: Use a formula (recommended)

Formulas allow you to automatically match participants across both spreadsheets using their Prolific Participant ID.

You can use functions such as:

  • XLOOKUP

  • INDEX-MATCH

  • VLOOKUP

This guide uses VLOOKUP as an example.

Step 1: Insert a VLOOKUP formula

In your survey response spreadsheet, select the cell where you want the demographic data to appear and enter a VLOOKUP formula.

The formula requires four pieces of information:

Lookup Value

The unique identifier used to match participants between spreadsheets.

In this case, use the Prolific Participant ID.

Table Array

The range of data from the demographic export that contains the information you want to import.

Important: The Participant ID column must be the leftmost column in the selected range.

Column Index Number

The position of the demographic column you want to return, counting from the Participant ID column.

For example:

  • Participant ID is in column A

  • Age is in column D

Since column D is the fourth column in the selected range, enter 4 as the column index number.

Range Lookup

This determines whether the match must be exact.

Enter FALSE to ensure only exact Participant ID matches are returned.

Step 2: Lock your references

To prevent the selected data range from changing when you copy the formula down the spreadsheet, lock the cell references using $ symbols.

In many spreadsheet applications, you can quickly do this by selecting the reference and pressing F4.

Step 3: Copy the formula down

Once you've confirmed the first formula returns the correct result:

  1. Select the cell containing the formula.

  2. Drag the fill handle down to apply the formula to all remaining rows.

Step 4: Repeat for additional columns

If you need to import multiple demographic fields, repeat the process for each column.

Step 5: Paste as values

After the formulas have returned the correct data:

  1. Copy the populated cells.

  2. Paste them back into the same location using Paste Special → Values (or the equivalent option in your spreadsheet software).

This converts the formulas into static data and prevents changes if the original spreadsheet is moved or edited later.


Option 2: Sort both spreadsheets by Participant ID

If using formulas isn't suitable, you can manually align the spreadsheets by sorting them.

Step 1: Sort both files

In each spreadsheet:

  1. Select the Participant ID column.

  2. Sort the data in ascending order.

Important: Make sure the entire spreadsheet is sorted, not just the Participant ID column. Sorting only one column will misalign your data.

Step 2: Verify the sort

Before copying any data, compare a few rows between both spreadsheets.

For example, check that the Participant ID in row 26 of one spreadsheet matches the Participant ID in row 26 of the other spreadsheet.

Spot-checking helps ensure the data has been sorted correctly.

Step 3: Copy the demographic data

Once both spreadsheets are aligned:

  1. Copy the demographic columns from the Prolific export.

  2. Paste them into the survey response spreadsheet.

We recommend pasting the data into new columns at the end of the spreadsheet so that no existing data is overwritten.


Which method should I use?

We recommend using a lookup formula whenever possible, as it:

  • Reduces the risk of human error

  • Automatically matches participants by ID

  • Works even when the datasets contain different numbers of rows

  • Makes it easier to update your data later

Sorting and manually copying data can be useful for smaller datasets, but requires extra care to ensure both spreadsheets remain perfectly aligned.

Did this answer your question?