Tag: Programming
All the articles with the tag "Programming".
-
Splitting Strings in Excel: A Simple Guide
When working with Excel, you may encounter situations where you need to split a string into separate parts. For example, consider the following string: orderId: 12345abc-de67-89fg-hijk-123456lmnop If
-
Understanding How Hive Converts SQL Queries into Hadoop Jobs
When you execute a SQL query in Apache Hive, the query is not directly run on a traditional database. Instead, Hive translates it into a Hadoop job, which is then executed across a distributed system.
-
Tracking Daily File Size Changes in SQL
When working with databases that store file metadata, it's often useful to track how file sizes change over time. If you have a table with the following structure: id | timestamp | name_file | size
-
Merging Data in PostgreSQL vs. MySQL: How to Handle Upserts
When working with databases, you often need to update existing records or insert new ones based on whether a match is found. In PostgreSQL, this is efficiently handled using the MERGE statement.
-
Optimizing Queries with Partitioning in Databricks
Partitioning is a crucial optimization technique in big data environments like Databricks. By partitioning datasets, we can significantly improve query performance and reduce computation time. This
-
Calculating Levenshtein Distance in Apache Spark Using a UDF
When working with text data in big data environments, measuring the similarity between strings can be essential. One of the most commonly used metrics for this is the Levenshtein distance , which
-
Creating a PySpark DataFrame for Sentiment Analysis
When working with sentiment analysis, having structured data in a PySpark DataFrame can be very useful for processing large datasets efficiently. In this post, we will create a PySpark DataFrame
-
Automating Payment Calculation in Google Docs Using Apps Script
Introduction Google Apps Script is a powerful tool that allows you to automate tasks within Google Workspace applications, such as Google Docs. In this tutorial, we will create a script that prompts