Theme

HU2 - Applicable Experience

Description

Participants in the experiment clearly had more previous experience with Python which will be an influence on how well they understood the data pipeline code. In contrast participants had no experience with Jayvee and its language design.

Python is one of the most commonly used general-purpose programming languages (GPLs) in data engineering alongside with external libraries like Pandas. As a result, anyone working with data will also be more exposed to data pipelines written in Python.

However, learning from this previous experience can be challenging because implementations differ due to low structure and the use of other external libraries (see also HU2.2 - No learning effect between pipelines due to low structure, PL4.2 - External libraries have different approaches and change).

Familiarity with Python and its ecosystem is also relevant for navigating documentation which in turn lowers the time needed to look up information (EX1.2 - Quality of documentation of Jayvee and Python).

Lastly, being exposed to programming by using Python or similar GPLs can create a preference for their syntax and mean readers are used to common programming approaches like object-oriented programming (see also PL5.2 - Unfamiliar syntax makes understanding difficult and PL6.3 - Jayvee language elements (blocks, value types, constraints) are unusual and need to be learned).

Representative Quotes

I have worked with Python so I was aware what was going on in the code and nothing looked new to me. This was the main reason for finding it easier.

  • S11

It is the most common tool that used in the area and also the one that is used most frequently, therefore I have already spent significant time with its syntax.

  • S17

As a programming language I only know Python so I always prefer its syntax.

  • S43

Gone through Python documentation many times from bachelors that is why it seems easy.

  • S45

We have been practicing Python and Pandas for a long time, so we are habituated with the structure and data pipeline making it more easier.

  • S51