Find Jobs
Hire Freelancers

Looking for someone who is good in Data Science and vector representation

$30-250 USD

Zamknięte
Opublikowano prawie 6 lat temu

$30-250 USD

Płatne przy odbiorze
GloVe that relies on a different algorithmic principle but also yields vector representations of words. Within the text file inside the archive, each line contains a word followed by a space and then a series of floating point numbers (also space-separated). The floating point numbers for a word (300 in total) constitute the word vector representation in a 300-dimensional word vector space. Please write code to achieve the following tasks and report the results. Do not use any libraries for the nearest neighbor computation, but instead write your own code for this. You may use any programming language (it is easy to store and manipulate a 300-dimensional array of floating point numbers in almost any programming language). --> Task 1 Determine the 5 nearest neighbours of your first name in terms of the cosine similarity measure, along with the respective cosine similarity scores. For each neighbour, list the word/name, not the vector. Note that you may need to lower-case your name to find it (e.g. “nicole” instead of “Nicole”). If (and only if) your first name is genuinely not covered by the word vector data, then report this fact and use the first name of a celebrity instead. --> Task 2 Write code to create a vector representation for an entire sentence simply by taking the average of all word vectors for words in that sentence. This involves 1) tokenizing a sentence, i.e., splitting it into words, for which you may use a very na¨ıve and imperfect method. Then 2) look up the word vectors for those tokens. Make sure to apply lower-casing if necessary. You may ignore tokens that are not covered by the vocabulary of the word vectors. Finally, 3) take the average, i.e. compute the component-wise sum of the word vectors, and then divide each component by the number of words in the sentence that were covered by the data. Next, choose a random sentence S0 and compute the vector representation of that sentence using the above method. List the nearest neighbour words to that sentence vector (i.e., determine which words in the data have a similar vector representation to the vector for the sentence). --> Task 3 Choose two other sentences S1 and S2 such that S1 is similar in meaning to S0, and S2 is dissimilar in meaning to S0. Create the sentence vectors using the method from Task 2, and report the cosine similarities between the vectors for S0 and S1, and between the vectors for S0 and S2. Explain whether the obtained cosine similarity scores are reasonable and give a brief explanation of why or why not.
Identyfikator projektu: 16766414

Informację o projekcie

5 ofert
Zdalny projekt
Aktywny 6 lat temu

Szukasz sposobu na zarobienie pieniędzy?

Korzyści ze składania ofert na Freelancer.com

Ustal budżet i ramy czasowe
Otrzymuj wynagrodzenie za swoją pracę
Przedstaw swoją propozycję
Rejestracja i składanie ofert jest bezpłatne
5 freelancerzy składają oferty o średniej wysokości $156 USD dla tej pracy
Awatar Użytkownika
Have worked with vector representations like word2vec, glove, fasttext etc. before for NLP project. Thanks for the complete description, would be able to do this task within 3 days. Looking forward to hearing back from you.
$180 USD w 3 dni
5,0 (53 opinii)
6,6
6,6
Awatar Użytkownika
Hello there, my name is Daniel and I would love to help you out with this project. I am very familiar with machine learning algorithms and NLP, so I have worked in the past with word vectors plenty of times before. I have completely read your requirements and I am confident I can achieve what you need. Hope to hear back from you soon. Thanks
$210 USD w 5 dni
5,0 (29 opinii)
6,0
6,0
Awatar Użytkownika
Greetings of the day! I am the best fit to your requested requirement. I can help you in Data Science and vector representation. My Expertise & Experience: I am skilled in Algorithm, Data Science, Machine Learning, Python, Vectorization. I would love to work with you on this project. Carrying total experience of 6.5 years. I can assure you, you won't disappointed with class of the work I produce. I can start the work on the go as per your need! Communication: I will be available on the freelancer.com chat 18 hours in the day. (FLEXIBLE WITH TIMINGS IN CASE OF THE NEED) I am just a message away in case of any queries.
$250 USD w 3 dni
4,9 (36 opinii)
5,9
5,9
Awatar Użytkownika
Hi, I am good at machine learning and data science and I am proficient in Python. I have a research degree in m/c learning from IIT Madras. Please consider and have a good day. Regards
$111 USD w 3 dni
0,0 (0 opinii)
0,0
0,0

O kliencie

Flaga UNITED STATES
Piscataway, United States
5,0
10
Zweryfikowana metoda płatności
Członek od gru 1, 2017

Weryfikacja Klienta

Dziękujemy! Przesłaliśmy Ci e-mailem link do odebrania darmowego bonusu.
Coś poszło nie tak podczas wysyłania wiadomości e-mail. Proszę spróbować ponownie.
Zarejestrowani Użytkownicy Całkowita Liczba Opublikowanych Projektów
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Wczytywanie podglądu
Udzielono pozwolenia na Geolokalizację.
Twoja sesja logowania wygasła i zostałeś wylogowany. Proszę, zalogować się ponownie.