I know how to extract language from a webpage now and get a program to read the data, but how do I make it search the data for the sequences contained in the input and find and print the most probable sequences to occur with those sequences?