Pyspark get related records from its array object values (Child values)

Ukończone Opublikowano 3 lat temu Płatność przy odbiorze
Ukończone

I have a spark dataframe that has an ID column and along with other columns, it has an array column that contains the IDs of its related records, as its value.

example dataframe will be of

ID | NAME | RELATED_IDLIST

--------------------------

123 | mike | [345,456]

345 | alen | [789]

456 | sam | [789,999]

789 | marc | [111]

555 | dan | [333]

893 | chad | null

From the above, I need to append all the related child Id's to the array column of the parent ID. The resultant DF should be like

ID | NAME | RELATED_IDLIST

--------------------------

123 | mike | [345,456,789,999,111]

345 | alen | [789,111]

456 | sam | [789,999,111]

789 | marc | [111]

555 | dan | [333]

893 | chad | null

-- I am trying to implement in Pyspark

need help figuring out the above req.

PySpark Spark Hadoop Python Scala

Numer ID Projektu: #28253757

O projekcie

7 ofert Zdalny projekt Aktywny 3 lat temu

Przyznany użytkownikowi:

chaitayamvamsi

Hi, I have 4 years of experience as a Big data engineer. I worked in PySpark for 3 years. I think I will be a good match. I can complete work in 1 to 2 hours. Please message me. Regards, Vamsi

$28 USD / godzina
(0 ocen)
0.0

7 freelancerów złożyło ofertę na średnią kwotę $20/godzinę w tym projekcie

CrazyProger

I'm a senior programer in Python. I read the job details extremely carefully. I think I can help you to complete your project with my rich knowledge and experience. I'm ready to do it now. Should you require furthe Więcej

$20 USD / godzina
(15 Oceny)
5.2
nmogilip

Hi, I am a certified big data developer, I worked on such kind of requirements many time. Please let’s connect and let’s start ASAP. In gm the given example I didn’t see the parent Id. Please share the actual data Więcej

$22 USD / godzina
(9 Oceny)
4.4
diegoedunapo

Hi, how are you? I'm data scientist specializing in: R / Python / Spark / Hadoop/ TensorFlow/ SQL / PowerBi. I need an example of the dataset and an example of the result you expect. I would like to know more in det Więcej

$16 USD / godzina
(1 Ocena)
2.8
sirishkarthik

hi I'm a spark developer good at python and pyspark. please let me know if you are interested. we will discuss.

$17 USD / godzina
(4 Oceny)
2.2
Harsha3966

I can very well help you with this project. Here's a small intro about me - I am a professional Data Engineer who has very good real time experience in working with most of the latest big data technologies like Scala, Więcej

$20 USD / godzina
(0 Oceny)
0.0