Glossary term
Glossary term
Foundations
Features represented as integers or real-valued numbers. For example, a house valuation model would probably represent the size of a house (in square feet or square meters) as numerical data. Representing a feature as numerical data indicates that the feature's values have a mathematical relationship to the label. That is, the number of square meters in a house probably has some mathematical relationship to the value of the house.
Not all integer data should be represented as numerical data. For example, postal codes in some parts of the world are integers; however, integer postal codes shouldn't be represented as numerical data in models. That's because a postal code of 20000 is not twice (or half) as potent as a postal code of 10000. Furthermore, although different postal codes do correlate to different real estate values, we can't assume that real estate values at postal code 20000 are twice as valuable as real estate values at postal code 10000. Postal codes should be represented as categorical data instead.
Numerical features are sometimes called continuous features.
See Working with numerical data in Machine Learning Crash Course for more information.
Created for this library
A retail data team uses numerical data like price, inventory level, and discount percentage as inputs to its demand model.
A finance analytics team uses numerical data from the general ledger for forecasting working capital with a tabular ML model.
A healthcare analytics team uses numerical data from lab tests as inputs to its risk stratification model.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License