The previous article on "Trusted Data", covered the definition on the term, but the article did not provide any leads yet on how to act on this. How do you work with data and ensure the data is trustworthy? Table of Contents We will now cover those questions, and we begin with some key areas on how you can collect and process data. This could be used as a check list to make data more trustworthy. The quantity of information available is so staggering that we cannot know everything about a subject. For example, it's estimated that anyone attempting to research what's known about depression would have to read over 100,000 studies on the subject. And there's the problem of trying to decide which studies have produced reliable results. If you have used the source before, you have probably acquired a level of trust with that source, relying on the data it provides. If you have not used the source, you can verify it by enquiring experience from other users of that source and validate the data itself. To recap on what sources could be: The interpretation method will vary from each use and scenario. The perspective used to interpret those data is strongly biased with our own understanding of the data. We could also say that our take on the data can be based on how we have interpreted the data before. Try to analyse the data from another view. If it is statistical data gathered from a quantitative survey, could the data be strengthened or weakened through a qualitative survey? If you monitor a server, how can you be sure that the server is healthy without actually "physically" checking the server? Do you rely on the data from the monitoring completely? When it comes to the interpretation of data, developers most likely will interact with a term called "false positives". We can quickly explain this by this example in JavaScript: The difference between Another example is this: This is because the created objects got different object references. One example on false positives that could have resulted in World War 3, is the 1979 NORAD NUCFLASH incident: The missile defense agency received alarming indications that a full scale battery of Russian nukes were heading toward the U.S. Planes were scrambled with fully armed nuclear weapons. The president's shielded emergency plane was put into the air too ... Fingers hovered over buttons. Commanders of flight crews waited for word to strike. For six tense minutes, no one was sure if World War III was happening …and oddly, no one used the "red phone" hotline to ask the Soviets. Finally, word came from Advanced Early Warning radar and satellites that no missiles were detected. The culprit? A training tape had accidentally been run and generated the false positive signals. Verify the data by checking against other sources. Re-collect it, is the data the same? Can the data be reproduced over time? Within IT-development, this could be ensured during a development lifecycle with unit-tests, data validation, checking charsets and protocols. The best way to find out if you can trust somebody is to trust them. In the end, you will have to have a whole lot of trust in yourself to be able to collect, analyse, interpret and relay data that can be trusted. You cannot trust data 100%, but you can put some effort into validating data. Check other reliable sources, be skeptic, question the validity of data. Re-collect information, be certain you have the correct information.Sources
Interpretation
False positives
==
and ===
is that the latter means "equality wihout type coersion", meaning that when we use ===
, we also check if the type is equal.Data
A final word