プロポーザル

これは応募されたプロポーザルです。聞きたいと思うプロポーザルを各ページの下部にあるSNSのボタンで拡散しましょう。拡散された投稿をプロポーザルへの投票としてカウントし、選考時に参考にさせていただきます。

talk

Running Dask in the Cloud(en)

スピーカー

Shane Cousins

対象レベル:

中級

カテゴリ:

Distributed Computing

説明

Is pandas running out of memory? Have you wanted to more easily process large scale data? dask may be what your looking for. Here I'll introduce the dask framework and talk about how you can get it running in the cloud

目的

Define what is Dask Explain why you'd want to use it. Learn how to get started setting up a dask distributed computing cluster

概要

Dask is a general purpose Spark-like big data computing framework that allows you to take advantage of Numpy/Pandas/Scikit-learn level complex algorithms, written in Pure Python. This talk provides a brief introduction of dask and focuses on how a dask cluster can be setup in the cloud to get you started with migrating your pandas/numpy workflows to more easily work with and process larger datasets.
  • このエントリーをはてなブックマークに追加
CONTACT