Home Pattern: A Bundle of Data Mining Modules for Python

Pattern: A Bundle of Data Mining Modules for Python

Pattern is a collection of open source (BSD license) web mining modules for Python from the Computational Linguistics and Psycholinguistics Research Center. It contains tools for data retrieval, text analysis and data visualization and comes with over 30 sample scripts.

Image

Image

Pattern consists of six main modules:

  • pattern.web: A toolkit that includes APIs for various Web services, including Google, Gmail, Bing, Twitter Wikipedia and Flickr. It has its own HTML parser and Web spider.
  • pattern.table: A module for working with tabular data, used for storing data from the pattern.web module.
  • pattern.en: A natural language processing toolkit for English.pattern.search: A module containing a search algorithm.
  • pattern.vector: A module containing various tools for analyzing the text of a document.
  • pattern.graph: A module for data visualization using Canvas.

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the gambling and blockchain industries for major developments, new product and brand launches, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.