en
Книжки
Luis Pedro Coelho

Building Machine Learning Systems with Python – Second Edition

Using machine learning to gain deeper insights from data is a key skill required by modern application developers and analysts alike. Python is a wonderful language to develop machine learning applications. As a dynamic language, it allows for fast exploration and experimentation. With its excellent collection of open source machine learning libraries you can focus on the task at hand while being able to quickly try out many ideas.
This book shows you exactly how to find patterns in your raw data. You will start by brushing up on your Python machine learning knowledge and introducing libraries. You'll quickly get to grips with serious, real-world projects on datasets, using modeling, creating recommendation systems. Later on, the book covers advanced topics such as topic modeling, basket analysis, and cloud computing. These will extend your abilities and enable you to create large complex systems.
With this book, you gain the tools and understanding required to build your own systems, tailored to solve your real-world data analysis problems.
541 паперова сторінка
Рік виходу видання
2015
Видавництво
Packt Publishing
Уже прочитали? Що скажете?
👍👎

Враження

  • aakononoffділиться враженням7 років тому
    👍Раджу

    Книга знакомит с основными концепциями машинного обучения. Так же есть примеры работы с jug - фреймворк для выполнения задач в нескольких потоках, AWS - сервис от амазона.

Цитати

  • Zaur Huseynovцитує4 роки тому
    But before you go there, you will have to define what you actually mean by "better". SciKit has a complete package dedicated only to this definition. The package is called sklearn.metrics and also contains a full range of different metrics to measure clustering quality. Maybe that should be the first place to go now. Right into the sources of the metrics package.
  • Zaur Huseynovцитує4 роки тому
    SciKit provides a wide range of clustering approaches in the sklearn.cluster package. You can get a quick overview of advantages and drawbacks of each of them at http://scikit-learn.org/dev/modules/clustering.html.
  • Zaur Huseynovцитує4 роки тому
    UCI Machine Learning Dataset Repository

    The University of California at Irvine (UCI) maintains an online repository of machine learning datasets (at the time of writing, they list 233 datasets). Both the Iris and the Seeds dataset used in this chapter were taken from there.

    The repository is available online at http://archive.ics.uci.edu/ml/.

На полицях

fb2epub
Перетягніть файли сюди, не більш ніж 5 за один раз