Description |
1 online resource (1 streaming video file (1 hr., 32 min., 57 sec.)) : digital, sound, color |
Summary |
"The course is designed for engineers and data scientists who have some familiarity with Scala, Apache Spark, and machine learning who need to process large natural language text in a distributed fashion.We will use sample of posts from the subreddit /r/WritingPrompts, which contains short stories and comments about the short stories.The course has four parts1. Building a natural language processing and entity extraction pipeline on Scala & Spark2. Machine Learning Applications for Statistical Natural Language Understanding at Scale3. Topic Modeling on Natural Language with Scala, Spark and MLLib4. Deep Learning Applications for Natural Language Understanding with Scala, Spark and MLLibYou will learn how use Apache Spark to process text with annotations, use machine learning with your annotations, create and use topic models, create and use a word2vec model."--Resource description page |
Notes |
Title from title screen (viewed January 25, 2017) |
Performer |
Presenters, David Talby, Alex Thomas |
SUBJECT |
Spark (Electronic resource : Apache Software Foundation) http://id.loc.gov/authorities/names/no2015027445
|
|
Spark (Electronic resource : Apache Software Foundation) fast (OCoLC)fst01938143 |
Subject |
Natural language processing (Computer science)
|
|
Data mining.
|
|
Electronic data processing -- Distributed processing.
|
|
Natural Language Processing
|
|
Data Mining
|
|
Data mining.
|
|
Electronic data processing -- Distributed processing.
|
|
Natural language processing (Computer science)
|
Form |
Streaming video
|
Author |
Thomas, Alex, on-screen presenter
|
|