Page 2 out of 15 results
Sort by
Newest Mastering Spark with R summaries
![Week 15](/docpics/3071296/65d696ba7bc7c_3071296_121_171.jpeg)
-
Week 15
- Summary • 3 pages • 2023
-
datascience24
-
- $10.49
- + learn more
Using the code provided in section 4.5.1 AND 4.5.2, create an LDA model of the essays in the okc dataset. Create charts of most common terms per topic. 
ANSWER THE FOLLOWING QUESTIONS: 
1. Explain how the html tags and newline characters were removed from the text. 
2. How were the individual words combined into complete essays? 
3. Explain what an LDA model does. 
4. What are stop words? 
5. In which 2 topics is the word 'want' not in the top ten? 
6. What code snippet causes the topics...
![Week 9](/docpics/3071262/65d69641a4d00_3071262_121_171.jpeg)
-
Week 9
- Summary • 3 pages • 2023
- Available in package deal
-
datascience24
-
- $10.49
- + learn more
The text is Mastering Spark with R. 
Using your sc connection and the correct Java version: 
1. explain what each element is doing in the following code blocks and the difference between these two code blocks that explains the difference in the output. Explain what the output tables mean: 
>cars %>% 
 + ml_linear_regression(mpg ~ .) %>% 
 + summary() 
>cars %>% 
 + ml_linear_regression(mpg ~ hp + cyl) %>% 
 + summary() 
cars %>% 
 + ml_linear_regression(mpg ~ ...
![Week 2](/docpics/3071227/65d6959061d70_3071227_121_171.jpeg)
-
Week 2
- Summary • 3 pages • 2023
- Available in package deal
-
datascience24
-
- $10.49
- + learn more
Following the instrutions in the book, install the sparklyr library and a Spark cluster on your local machine. The instructions in the book contemplate that you will be using Windows. If you are not running Windows, use a virtual machine. If you are using your employer's equipment, you may enounter a trusted domain error. This will also require that you install a virtual machine. The code in the book assumes Spark version 2.3 and Java 8. You are free to use any version, of course, but the co...
Do you also write study notes yourself? Put them up for sale and earn every time your document is purchased.
![Week 3](/docpics/3071235/65d695938f6be_3071235_121_171.jpeg)
-
Week 3
- Summary • 2 pages • 2023
- Available in package deal
-
datascience24
-
- $10.49
- + learn more
The text is Mastering Spark with R. 
After installing the dplyr library, use the following code in R studio, create a scatter plot of hp v. mpg 
 
cars2 <- copy_to(sc, mtcars) 
cars3 <- (cars2) 
cars4 <- select(cars3, hp, mpg) 
cars5 <- sample_n(cars4, 32) 
cars6 <- collect(cars5) 
plot(cars6)
![Week 1](/docpics/3071223/64b310db5d4cd_3071223_121_171.jpeg)
-
Week 1
- Summary • 1 pages • 2023
- Available in package deal
-
datascience24
-
- $10.49
- + learn more
Submit a Word or .pdf document with screen shots of R Studio where you have created a vector of three words. Whenever you are asked to submit a screenshot, include either a sliver of your desktop or a timestamp from your desktop. Always repeat the question you are answering.
![Verkoop je kennis op stuvia](https://www.stuvia.com/hosted-imgs/app/stock-fotos/banner_seller_big.jpg)
Study stress? For sellers on Stuvia, these are actually golden times. KA-CHING! Earn from your study resources too and start uploading now. Discover all about earning on Stuvia