DAO23: Data Architecture for Machine Learning

**This is Subscription-Only Content, It is NOT purchasable as a separate product**

Data Architecture for Machine Learning

Machine learning is at the cutting edge of modern data use cases. After years of slow and painstaking advances, machine learning is experiencing rapid adoption today. Machine learning uses statistical methods to make predictions and to automatically improve prediction accuracy over time. In business it is readily applied to common analytics problems such as quantitative investing, customer recommendations, medical diagnosis, predictive maintenance, fraud prevention, and more. And today’s hottest technologies – generative AI – are built on a machine learning foundation. We are in the early stages of a steep adoption curve for machine learning.

To prepare for machine learning, it is important to focus on Data Architecture. You don’t need (and don’t want) a separate Data Architecture for machine learning. Instead, think about how to extend your existing Data Architecture. What new Data Management capabilities do you need? What new architectural features and functions should be intergrated into your Data Architecture?

A Google data scientist once said that simple models working with very large datasets are more accurate than complex models with small amounts of data. So, data volume is certainly a consideration, but it is only the beginning. Join this session to learn about these Data Architecture considerations for machine learning:

Data Architecture for massive data volumes

Real-time data pipelines

A single database for transactions and analytics

Redeploying batch models in real time

Data preparation for supervised learning

Data preparation for unsupervised learning

Write-back Data Management (machine learning creates new data)

 

Speaker: Dave Wells

Consultant and Educator, Infocentric

Dave Wells has more than 50 years of experience working with data and technology in many roles including application development, data architecture, data governance, IT management, business management, technology research, and more. Dave also has more than 30 years of experience as an information management and information technology educator including curriculum management, instructional design, course development, and teaching in live classroom, virtual, and e-learning formats. See Dave’s blog at eckerson.com to read his thoughts about a variety of data and information management topics.

Subscription Purchase Options

Become a DATAVERSITY Insider when you subscribe and gain access to a host of special content.

Share This

Whats Included


Access your courses anytime, anywhere, with a computer, tablet or smartphone

Videos, quizzes and interactive content designed for a proven learning experience

Unlimited access. Take your courses at your time and pace