Tuesday, January 3, 2017

Data Analysis (percentages) with Python

This is a great example of how encryption experts and homeland security 'spies' can analyze large quantities of data and flag elements that might be suspicious or important. 

This Python module allows you to compare a given sentence, paragraph, or phrase and compare it to a 350,000 word English dictionary to determine whether the sentence is written in English.  It simply asks the question, "What percentage of the words in the given sentence were in the dictionary file?"  It then makes a decision based on that percentage.

The students may not be up for coding this, but a teacher could easily walk students through a sample sentence to determine what percentage of the words were in the dictionary file.

No comments:

Post a Comment