How you can introduce the schema in a Row in Spark?

Python3

from pyspark.sql import SparkSession

from pyspark.sql.sorts import StructType, StructField, IntegerType, StringType

from pyspark.sql import Row

spark = SparkSession.builder.appName("Schema").getOrCreate()

spark

Python3

schema = StructType([

StructField("id", IntegerType(), True),

StructField("name", StringType(), True),

StructField("age", IntegerType(), True)

])

Python3

information = [[101, "Sravan", 23],

[102, "Akshat", 25],

[103, "Pawan", 25],

[104, "Gunjan", 24],

[105, "Ritesh", 26]]

Python3

df = spark.createDataFrame(information, schema=schema)

df.present()

Python3

from pyspark.sql import SparkSession

from pyspark.sql.sorts import StructType, StructField, IntegerType, StringType

from pyspark.sql import Row

spark = SparkSession.builder.appName("instance").getOrCreate()

schema = StructType([

StructField("id", IntegerType(), True),

StructField("name", StringType(), True),

StructField("age", IntegerType(), True)

])

row = Row(id=100, title="Akshat", age=19)

df = spark.createDataFrame([row], schema=schema)

df.present()

df.printSchema()

spark.cease()

How you can introduce the schema in a Row in Spark?

Ideas associated to the subject

Examples 1:

Step 1: Load the required libraries and features and Create a SparkSession object

Python3

Step 2: Outline the schema

Python3

Step 3: Record of worker information with 5-row values

Python3

Step 4: Create an information body from the information and the schema, and print the information body

Python3

Step 5: Print the schema

Step 6: Cease the SparkSession

Instance 2:

Steps wanted

Creating an information body with a number of columns of various sorts utilizing schema.

Python3

Leave a Reply

Categories

Pages

Programmer’s Academy