Skip to content
TheCodersStop
  • Home
  • Contact Us
  • About
TheCodersStop
TheCodersStop
  • Home
  • Contact Us
  • About

scala

by Pradeep Mishra
comments 0 date February 13, 2021

Generate Sequential and Unique IDs in a Spark Dataframe

Apache Spark is an open source, general-purpose distributed computing engine used for processing and analyzing a large amount of data. Hence, adding sequential and unique IDs to a Spark Dataframe is not very straight forward, because of distributed nature of it.

Share this:

  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • Click to share on WhatsApp (Opens in new window) WhatsApp
  • Click to share on Pinterest (Opens in new window) Pinterest
  • More
  • Click to share on Telegram (Opens in new window) Telegram
  • Click to email a link to a friend (Opens in new window) Email
  • Click to share on Reddit (Opens in new window) Reddit

Like this:

Like Loading...

Continue Reading
by Pradeep Mishra
comments 0 date August 16, 2020

Spark Partitions with Coalesce and Repartition (hash, range, round robin)

One main advantage of the Apache Spark is, it splits data into multiple partitions and executes operations on all partitions of data in parallel which allows us to complete the job faster.While working with partition data we often need to increase or decrease the partitions based on data distribution. Methods repartition and coalesce helps us to repartition.

Share this:

  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • Click to share on WhatsApp (Opens in new window) WhatsApp
  • Click to share on Pinterest (Opens in new window) Pinterest
  • More
  • Click to share on Telegram (Opens in new window) Telegram
  • Click to email a link to a friend (Opens in new window) Email
  • Click to share on Reddit (Opens in new window) Reddit

Like this:

Like Loading...

Continue Reading
by Pradeep Mishra
comments 4 date August 16, 2020

Scala String Interpolation

Introduction String Interpolation refers to substitution of defined variables or expressions in a given String with respected values. String Interpolation allows users to embed variable references directly in processed string literals.. read more…

Share this:

  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • Click to share on WhatsApp (Opens in new window) WhatsApp
  • Click to share on Pinterest (Opens in new window) Pinterest
  • More
  • Click to share on Telegram (Opens in new window) Telegram
  • Click to email a link to a friend (Opens in new window) Email
  • Click to share on Reddit (Opens in new window) Reddit

Like this:

Like Loading...

Continue Reading
by Pradeep Mishra
comments 0 date August 16, 2020

Multiline Strings in Scala

You want to create multiline strings within your Scala source code, like you can with the heredoc syntax of other languages and help in escaping quotes and other symbols. A heredoc is a way. read more…

Share this:

  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • Click to share on WhatsApp (Opens in new window) WhatsApp
  • Click to share on Pinterest (Opens in new window) Pinterest
  • More
  • Click to share on Telegram (Opens in new window) Telegram
  • Click to email a link to a friend (Opens in new window) Email
  • Click to share on Reddit (Opens in new window) Reddit

Like this:

Like Loading...

Continue Reading
Recent Posts
  • Schedule Cron Job with time zone in Openshift (Kubernetes)
  • Build a module with its dependencies in a multi-module Maven project
  • Schedule Jobs/Tasks Using Cron Expression in Spring with Example
  • Accessing Data with JPA
  • Java Message Service (JMS) programming model
Categories
  • Design (2)
  • Elasticsearch (3)
  • Golang (5)
  • hadoop (2)
  • Java (11)
  • Kubernetes (1)
  • linux (1)
  • Maven (1)
  • Openshift (1)
  • Performance (1)
  • Scala (2)
  • Security (3)
  • Spark (2)
  • Spring (2)
  • Spring Batch (1)
  • Spring Boot (1)
  • sqoop (1)
  • UI (1)
  • unix (1)
  • Vim (2)
Tags
annotation apache hive 3 cdp cloudera commands CORS cronjob design pattern DNS elastic elasticsearch go golang hadoop hdfs hive http ip java jms junit Kubernetes mq mysql nginx Openshift oracle proxy proxy_pass queue rdbms resolution resolver reverseproxy scala server spark spring springboot spring boot sqoop string timezone upstream vim
Recent Posts
  • Schedule Cron Job with time zone in Openshift (Kubernetes)
  • Build a module with its dependencies in a multi-module Maven project
  • Schedule Jobs/Tasks Using Cron Expression in Spring with Example
  • Accessing Data with JPA
  • Java Message Service (JMS) programming model
Categories
  • Design (2)
  • Elasticsearch (3)
  • Golang (5)
  • hadoop (2)
  • Java (11)
  • Kubernetes (1)
  • linux (1)
  • Maven (1)
  • Openshift (1)
  • Performance (1)
  • Scala (2)
  • Security (3)
  • Spark (2)
  • Spring (2)
  • Spring Batch (1)
  • Spring Boot (1)
  • sqoop (1)
  • UI (1)
  • unix (1)
  • Vim (2)
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
Tags
annotation apache hive 3 cdp cloudera commands CORS cronjob design pattern DNS elastic elasticsearch go golang hadoop hdfs hive http ip java jms junit Kubernetes mq mysql nginx Openshift oracle proxy proxy_pass queue rdbms resolution resolver reverseproxy scala server spark spring springboot spring boot sqoop string timezone upstream vim
Find Us

Address
123 Main Street
New York, NY 10001

Hours
Monday–Friday: 9:00AM–5:00PM
Saturday & Sunday: 11:00AM–3:00PM

TheCodersStop

©Copyright 2021. Built with ATTIRE

%d

    Terms and Conditions - Privacy Policy