Read the entire contents.
Converted it into String.
Removed the HTML Tags and the Whitespace characters
Filtered out only valid words without special characters ( only hypen permited )
Once all the filtering process was done, General words like "the", "a" etc was removed
After that, the string was converted into Array.
Using the callback of the following function, I calculated the Frequency of the words.
[array_count_values() returns an array using the values of the input array as keys and their frequency in input as values. ]
Then The Top 15 words were selected, converted into string and printed out.
They were embedded in BOLD characters automatically!
Now the **** thing that happened was, All the spelling mistakes and things like LOL and OK and hundreds of diffrent words started comming up at the top 20. Any suggestions in which I can make only top valid english meaningful words as the keyword.

