stkim1 Profile

sung-taek, Kim

Join Date:

Blogs Owned

Personal BigData Cluster built on Raspberry PI. Tips, History, BigData Solution,

Visit Follow

Other Tags: BigData, Spark, RaspberryPI, Hadoop, MachineLearning

Latest Blog Posts

  • Weekly BigData & ML Roundup – Dec. 15, 2016
    on Dec 15, 2016 in Index
    Example Natural Language Processing in 10 Lines of Code PyCon 2016 workshop Natural Language Processing in 10 Lines of Code   Toolset Pinpoint Pinpoint is an open source APM (Application Performance Management) tool for large-scale distributed s...
  • Weekly BigData & ML Roundup – Dec. 8, 2016
    on Dec 8, 2016 in Index
    This week opens with Apache Drill updated to v1.9 and the releases of two open-source AI training platforms, Lab by OpenAI and Universe by DeepMind. Example Deepmind Learning to Learn Learning to Learn in TensorFlow Toolsets Appbaseio Gem GUI for D...
  • Weekly BigData & ML Roundup – Dec. 1, 2016
    on Dec 1, 2016 in Index has open-sourced its self-driving agent, OpenPilot. Examples OpenPilot Open source driving agent SnakeGame A classic game of snake that is controlled by a neural network and trained using a genetic algorithm Models Twitter AnomalyDe...
  • Weekly BigData & ML Roundup – Nov. 24, 2016
    on Nov 24, 2016 in Index
    Two eye-catching Machine-Learning libraries, PHP-ML and Skale-ML, written in PHP and Node.js respectively, are found in this week. Is this a sign of the up-coming wide-spread of ML everywhere? Special thanks to Nam Vu (@zuzoovn) for contributing his...
  • Weekly BigData & ML Roundup – Nov. 17, 2016
    on Nov 17, 2016 in DeveloperKit
    We have is pocket-full of goodies today! Airbnb Caraval is renamed to superset. Also, we have MLAlgorithm which offers a rare opportunity to “learn internals of ml algorithms or implement them from scratch” with simpler, easier codebase...
  • Travel to San Francisco
    on Aug 28, 2016 in DeveloperKit
    Hello folks, Due to a short trip to San Francisco for meetings, there won’t be round-ups for this week and next week. I hope your understanding.  ...
  • Weekly BigData & ML Roundup – Aug. 21, 2016
    on Aug 21, 2016 in Index
    It might change in the future, but there are few reasons to have only four categories up to date. It is a crude measure to separate all the great projects, but gives you a rough idea of what a project might be. Today, the fifth category, Model, is ad...
  • Weekly Roundup – Aug. 14, 2016
    on Aug 14, 2016 in Index
    Examples Facebook darkforestGo DarkForest, the Facebook Go engine DCGAN-Completion with Tensorflow Image Completion with Deep Learning in TensorFlow Spark Movie Lens An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset RN...
  • Weekly BigData Roundup – Aug. 7, 2016
    on Aug 8, 2016 in Index
    Microsoft releases a .Net framework for Apache Spark, named Mobius. Libraries Mobius C# language binding and extensions to Apache Spark Kafka-ETL-Consumer Kafka ETL consumes avro encoded data from Kafka and saves it to Parquet on HDFS NNPACK Accelera...
  • Weekly BigData Roundup – Aug. 1, 2016
    on Jul 31, 2016 in Index
    Facebook has opensourced a DeepLearning framework Torchnet. Here is a detailed blog post on the release. Libraries Ibis Productivity-centric Python data analysis framework for SQL systems and the Hadoop platform. Co-founded by the creator of pandas...
  • Weekly BigData Roundup – July 24, 2016
    on Jul 24, 2016 in Index
    Cisco has open-sourced Network and Services monitoring and analysis BigData framework PNDA project. Toolsets Clusterize.js Tiny vanilla JS plugin to display large data sets easily Google BigData-Interop Libraries and tools for interoperability betwee...
  • Weekly BigData Roundup – July 15, 2016
    on Jul 15, 2016 in Index
    We can pretty much sum up this week with two highlights; Amazon Scalable Tensor Network Engine DSSTNE and Yahoo Massively Parallel ADMM over Spark. Come check it out! Frameworks Amazon DSSTNE Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an...
  • Weekly BigData Roundup – July 11, 2016
    on Jul 10, 2016 in Index
    Starting from next week, Deep Learning stacks such as TensorFlow or Caffe and their related libraries and examples will appear as they become a vital part of machine learning pipeline where Big Data stacks are foundation. The main focus of roundup i...
  • Weekly BigData Roundup – July 1, 2016
    on Jun 30, 2016 in Index
    Apache has recently promoted Bahir and OODT as top projects. Metron is in incubation as well. Libraries Apache Bahir Apache Bahir provides extensions to distributed analytic platforms such as Apache Spark. MLDB MLDB is the Machine Learning Database F...
  • Weekly BigData Roundup – May 22, 2016
    on May 22, 2016 in Index
    Framework Brooklyn Apache Brooklyn helps to model, deploy, and manage systems. Libraries GraphFrames Users can write highly expressive queries by leveraging the DataFrame API, combined with a new API for motif finding. The user also benefits from Dat...
  • Weekly BigData Roundup – May 12, 2016
    on May 12, 2016 in Index
    This week’s roundup lists a cool new project, Sparkoin, a blockchain analyzer based on Apache Spark! Frameworks DistributedLog DistributedLog (DL) is a high-performance, replicated log service, offering durability, replication and strong consi...
  • Weekly BigData Roundup – May 5, 2016
    on May 5, 2016 in Index
    Libraries CoreNLP wrapper for Spark CoreNLP wraps Stanford CoreNLP annotation pipeline as an Apache Spark ML Transformer. Distributed Machine Learning Common Codebase A common bricks library for building scalable and portable distributed machine lear...
  • Six PINE64 have just arrived!
    on May 2, 2016 in DeveloperKit Update
    Back in December of last year, I spotted a cool Kickstarter project, PINE64. It was a new kid on the block and I gave it my try, actually a try of six packs.😉 Here today, they have arrived, with a Kickstarter welcome message, finally!  ...
  • Weekly roundup – Apr. 29, 2016
    on Apr 28, 2016 in Index
    Examples Spark GraphX Example Just some example of using GraphX. Spark Prodict Behavior Based On Past Activities This is an example of how to do window analysis with Spark. Spark Table Stats Example Simple Spark example of generating table stats for...
  • Weekly roundup – Apr. 22, 2016
    on Apr 22, 2016 in Index
    Examples Kite Apps Prescriptive Applications over Kite and Hadoop. Hadoop Mini Clusters Collection of Hadoop Mini Clusters. Spark Application Templates This repository contains basic Templates for Simple Spark Application, Simple SparkStreaming Appli...