Analysing tweets in R programming.

Ukończone Opublikowano 4 lat temu Płatność przy odbiorze
Ukończone Płatność przy odbiorze

We need to use R programming. and we can choose any company from the Tweeter for example, Apple, Samsung, Microsoft whatever.

1. Use rtweet library to download 1000 tweets that the company posted. Save these tweets as “[login to view URL]" ”.

2. Use rtweet library to download 1000 tweets about the company you selected. Save these tweets as “[login to view URL]".

3. Examine the source column of both the company and the public tweets to see the source of tweets. Find out how many different levels of sources exist in the public and company tweets.

4. Draw a bar plot of the top 10 most frequent tweet sources for both company tweets and the public tweets. Label each bar with the source name.

5. Comment on your bar plots.

6. By using an appropriate statistical test, test whether retweeting is independent of the tweet source that the public posted. Use the “source” and “is_retweet” columns to get the source and retweet information. Group the sources as; “Salesforce - Social Studio”, "Twitter for Android", “Twitter for Ipad”, “Twitter for iPhone”, “Twitter Web App”, “Twitter Web Client” and “Other”.

7. What is the conclusion of the test? Interpret your results.

8. Calculate a 95% confidence interval of the text width used in the tweets that the company posted. Use the “display_text_width” column to get this information.

9. Combine "[login to view URL]" and "[login to view URL]" and save as "tweets".

10. Clean and pre-process the data (use TFIDF weights in your analysis).

11. Compute the most appropriate number of clusters using the elbow method for the combined "tweets" by using cosine distance.

12. Cluster the tweets using the most appropriate clustering method.

13. Visualize your clustering in 2-dimensional vector space. Show each cluster in a different colour and the tweets in "[login to view URL]" and "[login to view URL]" with different symbols in your visualization.

14. Comment on your visualization.

15. Compute the proportion of "[login to view URL]" at each cluster. Print these proportions.

16. Which clusters are dominated by the public and which are dominated by the company?

17. Draw a word cloud and a dendrogram of these two clusters to understand the theme of the clusters.

We are unsure if friending leads to an increase in popularity. To examine this, we will: (you can use twitteR package in this section).

18. Find the most popular 10 friends of the chosen Twitter handle.

19. Obtain a 1.5-degree egocentric graph centred at the chosen Twitter handle and plot the graph. The egocentric graph should contain the most popular 10 friends of the chosen Twitter handle.

20. Compute the betweenness centrality score for each Twitter handle in our graph. List the top 3 most central people in your graph according to the betweenness centrality.

21. Comment on your results.

R Język Programowania

Numer ID Projektu: #21587852

O projekcie

5 ofert Zdalny projekt Aktywny 4 lat temu

Przyznany użytkownikowi:

faiiz

Hi Sir I am interrested in this project. My name is Faiz and im a statistician and a professional data scientist, a combination that provides me with the full suite of tools to do data analysis, data modelling and data Więcej

$15 AUD w 1 dzień
(2 ocen)
2.1

5 freelancerów złożyło ofertę za $347 w tym projekcie

usmanhassan123

Having Masters in computer science and Mphill in statistics. I am a professional Data analyst. I already done more then 150 Statistical analysis projects in my career by using different tools like: R programming, SP Więcej

$400 AUD w ciągu 1 dnia
(21 Oceny)
4.6
qaisrani123

I am already working same like that project. I am interested to work with you..........................................HI, I am data scientist and have good experience in python and R programming. My area of interest i Więcej

$70 AUD w ciągu 5 dni
(8 Oceny)
3.8
suyashdhoot

Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Więcej

$1000 AUD w ciągu 7 dni
(19 Oceny)
5.1
Softeria

I have done MS Software Engineering. I have 5 years’ experience in DATA ENGINEERING, ML and Artificial Intelligence. I know all data mining techniques, Deep learning and data analysis techniques. I have worked on K-me Więcej

$250 AUD w ciągu 7 dni
(1 Ocena)
2.2