Data Science & Tech Blog

Tropical Geometry and Neural Networks

By Sam Shideler on Thu, Apr 11, 2019

Algebraic geometry is not a subject that often arises in conversations around data science and machine learning. However, recent work in the field of tropical geometry (a subset of algebraic geometry) suggests that this subject might be able give some insights into the types of functions representable by neural networks (as well as give some upper bounds on the complexity of functions representable by neural nets of fixed width and depth).

Tags: algebraic geometry

When (not) to Lemmatize or Remove Stop Words in Text Preprocessing

By Alex Schumacher on Thu, Mar 21, 2019

Natural language text is messy. It’s full of disfluencies (‘ums’ and ‘uhs’) or spelling mistakes or unexpected foreign text, among others. What’s worse, even when all of that mess is cleaned up, natural language text has structural aspects that are not ideal for many applications. Two of those challenges, inconsistency of form and contentless material are addressed by two common practices: lemmatization and stop word removal. These practices are effective countermeasures to their respective problems, but they are often taken as writ when in fact they should are application- and problem-specific. In this blog, I’ll be discussing lemmatization and stop word removal, why they’re done, when to use them, and when not to.

Tags: NLP, AI, robots

An introduction to survival analysis

By Shane Pederson on Thu, Feb 07, 2019

It’s one of mankind’s oldest questions - how long will it run? How long will I live? Predicting the length of life has occupied thinkers, scientists, and everybody else throughout history.

Tags: survival