The three principles of tidy data are variables are in columns, observations are in rows, and values are in cells. These three rules are interrelated because it’s impossible to only satisfy two of the three. There is an advantage to having your data stored in a specific way, meaning it’s easier to use the built in functions for whatever program your’e using since they often encourage uniformity. These rules could prove very beneficial within my scholarly endeavors because it could keep the jumbled mess of information i can produce into a tidy, neat, and easily digestible for any reader that may come across it. With variables being laid across the top of the table this puts my data in a sort of categorical format. Meaning if someone was looking for multiple data values for a specific category of their interest, all they would need to do is find it along the top and bring their eyes down the screen to compile the values of said category. Which would make my data more digestible and easier for other people to use (after the proper citing of course.) The observations being organized into the rows is important because it now creates an intersection between two categories. Meaning someone could use the data from one category with or without using it’s relation to the other category. For example, Someone could look for the specific murders that occurred happened in Stockton. Murders being the variable while Stockton is the observation. Now say if someone wanted to know the amount of people who survived shootings in Stockton. The new variable being those who survived shootings in the same place being observed which is Stockton. When both a variable and observation intersect they create a value in an individual cell. This value is the answer someone is looking for when a question is posed. The values are organized according to where they intersect and this makes it easier to find the answer to a specific question. The easier it is for my data to digest the better it is to be used by others in their efforts of knowledge gathering. This type of organization has been adopted by scholars all over the world and because of this, we can all understand each others’ work. Scholars from Germany can compile a data table using the three tidy data rules and someone from America can understand the data presented with ease.

Leave a comment

Your email address will not be published.