Posts Tagged: nlp

Home » nlp

spaCy – A fast natural language processing library

spaCy is a rather new library (written in Python and Cython) for performing various NLP-related tasks such as tokenization, POS-tagging and syntactic parsing. The authors claim it's faster (in some cases a lot faster) than other common solutions such as NLTK or Stanford's CoreNLP and peer reviews seem to corroborate these claims. spaCy is English-only for now. If you're working on performance-intensive NLP tasks and if that's no deal-breaker for you might want to check it out. The source code is available ... Read more

The Turing Exception by William Hertling

About a year ago I wrote about the "Avogadro Corp" by William Hertling, the first book of his Singularity Series. A week ago I finished reading "The Turing Exception", the fourth and final instalment and a worthy close to the series. The Turing Exception follows common patterns of the series. Avogadro Corp was set in 2015 with its sequels taking place 10 years later each. With The Turing Exception we've now arrived around 2045 and a world that in some respects still ... Read more

I don’t want an app for that

Scott Adams of Dilbert fame recently posted an article on why your phone interface is a legacy train wreck. He argues that the way we interact with our smartphones goes all the way back to the beginning of desktop computing. In spite of what Apple probably would have us believe - in spite of swiping, tapping and multitouch - we're still largely using our computing devices as if they were a 1987 IBM PC running Microsoft Word or Excel: When trying ... Read more

Deep Learning for NLP

Richard Socher, Chris Manning and Yoshua Bengio have created a tutorial on "Deep Learning for NLP (without Magic)". The tutorial includes slides and two videos of talks held on the subject. It deals with how deep learning algorithms can be applied in natural language processing. Deep learning is a set of algorithms and models which work under the assumption that observed data is generated from multiple layers of hidden representations that interact with each other. Although not really new and for some ... Read more

Hemingway: Readibility Scores And Smart Suggestions On Style

Hemingway is an interesting new web app that not only assigns a score to the readability of your text but also makes smart suggestions regarding how to simplify your text in order to make it more understandable. Having worked on readability algorithms before I think this is a well-designed take on improving the usefulness of readability scores. Knowing that your text scores high or low in terms of readability only gets you so far. Hemingway additionally uses colour-coding to make suggestions such as: split ... Read more

Closer than you think

I've recently finished reading Avogadro Corp by William Hertling and the sequel A.I. Apocalypse. These books deal with the idea how artificial intelligence might come about today or in the near future. The story's main premise is the eponymous Avogadro Corp, a hardly disguised Google. This company, whose name is conveniently related to a large number as well, offers a wide range of Internet services: Search, web-based office suite, web-based eMail (AvoMail ...) and its own smartphone OS (AvoOS). Sounds familiar? The story ... Read more

Natural Language User Interfaces And Internet Search

Recently, there was an article at Wired about IBM’s Watson and how IBM might be able to supersede Google as the dominant search engine by providing a question-answering kind of search engine. Every few years the idea of a natural language / semantic / question answering search engine crops up again. Indeed, natural language understanding is quite relevant for the crawling and indexing part of information retrieval systems and Google is very good at that. Just look at their quite formidable automatic translation ... Read more

Named Entity Recognition: Tools And APIs

Named entity recognition is a subtask of information extraction. It deals with extracting the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages etc. from unstructured or semi-structured data such as eMails or web pages. These are few useful tools and APIs that provide named entity recognition functionality: AlchemyAPI: REST API that provides a number of natural language processing (NLP) and information extraction features DPpedia Spotlight: Automatically links DPpedia resources OpenCalais: NLP API / web service by Thomson Reuters Read more

Topicalizer – an information extraction suite – now open source

Topicalizer is a suite of text analysis and information extraction tools developed by me. It used to be available under http://www.topicalizer.com. However, I unfortunately don't have any time any more to properly maintain it, which is why I'm open-sourcing the code for others to learn from and build upon: https://github.com/BjoernKW/Topicalizer Read more

Privacy Preference Center

Strictly necessary

These cookies are necessary for the site to function.

PHPSESSID: Preserves user session state across page requests.

PHPSESSID

Statistics

Google Analytics statistics cookies help us to understand how visitors interact with our websites by collecting and reporting information anonymously.

You can opt out of Google Analytics tracking by clicking on the opt-out link in the banner below.

_ga: Registers a unique ID that is used to generate statistical data on how the visitor uses the website.

_gat: Used by Google Analytics to throttle request rate.

_gid: Registers a unique ID that is used to generate statistical data on how the visitor uses the website.

ga-disable-UA-25326096-9: Stores whether you have opted out of Google Analytics tracking.

_ga,_gat,_gid,ga-disable-UA-25326096-9

Privacy

These cookies are used for storing your privacy settings

gdpr%5Bprivacy_bar%5D: Privacy settings have been reviewed.

gdpr[consent_types]: The uses of your data you agreed to.

gdpr[allowed_cookies]: The cookies you allowed us to set.

gaoop_hide_info: Set if you agreed to our use of Google Analytics.

gdprprivacy_bar,gdpr[consent_types],gdpr[allowed_cookies],gaoop_hide_info

Security

We use Wordfence to secure our website against hacking attempts: https://www.wordfence.com/

Cookies set by the Wordfence plugin
To help you understand which cookies the Wordfence plugin sets, when installed on your WordPress site, we have provided the guide below. Wordfence currently sets three cookies and we explain what each cookie does, who will have the cookie set, and why the cookie helps secure your site.

wfwaf-authcookie-(hash)
What it does: This cookie is used by the Wordfence firewall to perform a capability check of the current user before WordPress has been loaded.

Who gets this cookie: This is only set for users that are able to log into WordPress.

How this cookie helps: This cookie allows the Wordfence firewall to detect logged in users and allow them increased access. It also allows Wordfence to detect non-logged in users and restrict their access to secure areas. The cookie also lets the firewall know what level of access a visitor has to help the firewall make smart decisions about who to allow and who to block.

wf_loginalerted_(hash)
What it does: This cookie is used to notify the Wordfence admin when an administrator logs in from a new device or location.

Who gets this cookie: This is only set for administrators.

How this cookie helps: This cookie helps site owners know whether there has been an admin login from a new device or location.

wfCBLBypass
What it does: Wordfence offers a feature for a site visitor to bypass country blocking by accessing a hidden URL. This cookie helps track who should be allowed to bypass country blocking.

Who gets this cookie: When a hidden URL defined by the site admin is visited, this cookie is set to verify the user can access the site from a country restricted through country blocking. This will be set for anyone who knows the URL that allows bypass of standard country blocking. This cookie is not set for anyone who does not know the hidden URL to bypass country blocking.

How this cookie helps: This cookie gives site owners a way to allow certain users from blocked countries, even though their country has been blocked.

wfvt_#,wordfence_verifiedHuman,wfwaf-authcookie-(hash),wf_loginalerted_(hash),wfCBLBypass

Close your account?

Your account will be closed and all data will be permanently deleted and cannot be recovered. Are you sure?