Skip to main content

Methodology

The surveyā€‹

The survey was fielded form February 1st to February 29th, 2024. It collected 517 entries.

We estimated that the survey should take between 5 to 10 minutes to complete. The form was first designed in English then we made sure to translate it to Arabic and French.

We used Google Forms to capture answers to have a user experience that our participants are used to, and because it was a free and easy option to go for.

The survey was anonymous, we didn't collect any personal identifiable information, and we didn't use any analytics tools on the forms, or on the landing page.

Confidentiality & data usageā€‹

Participation in the survey was entirely voluntary. All responses were kept confidential. No personal identifiable information was collected or disclosed in any reports or publications.

All data was analyzed and reported in aggregate, ensuring that individual responses cannot be traced back to specific participants.

To ensure compliance with Algerian regulations, we went through the law NĀ°18-07, and we requested a confirmation from ANPDP that we are not collecting any personally identifiable information.

Data analysisā€‹

In the first phase of our data analysis we mapped data from different forms with our scripts. Then we cleaned up our data from fake entries.

To determine "fake entries" we cross checked some inputs, compared salaries to role and experience levels and checked free-text answers.

Data cleaning was done in serval phases:

1. Cleaning pass 1: Manually flagging entriesā€‹

We ran a manual scan (yes, we went through 500+ lines of Excel) and flagged entries with 3 colors:

  1. šŸŸ© Green: Entries that seems coherent e.g. age, years of experience, job title/level and salary are reasonable.
  2. šŸŸ§ Orange: Entries that seem "odd", an exaggerated salary, or a high title for few years of experience, a long list of selected "used" technologies, etc.
  3. šŸŸ„ Red: Entries that have conflicting data and it's clear that it's fake (for example we had an entry from a student who has 2 years of experience and who claimed that their salary is 2000000000 euros per year), or duplicated.

While doing that, we redacted company names and swear words in some entries (some company names were mentioned in free text fields in "challenges" question).

After flagging all entries, we did a second pass on the dataset to cross-check the flags before we export the "clean" dataset.

2. Cleaning pass 2: Creating data clusters and find anomaliesā€‹

In this phase we clustered data to identify abnormal entries that we might missed, or mis-flagged in the first phase.

2. Cleaning pass 3: Interviewing experts and actors in Algerian tech industryā€‹

Since data is unavailable or outdated, and our team is not working in the Algerian market, It was only fair to run our findings through experts and personas who are working day-to-day in the Algerian tech industry.

The experts and personas are defined in our insights page.

Interviewsā€‹

Research & Crawlingā€‹