Excel BI - Excel Challenge 750

excel-challenges

excel-formulas

🔰 Data Answer Expected Emp ID Name Age Robert Anne Lisa Align the data as shown.

Published

March 24, 2026

Illustration for Excel BI - Excel Challenge 750

Challenge Description

🔰 Data Answer Expected Emp ID Name Age Robert Anne Lisa Align the data as shown.

Solutions

library(tidyverse)
library(readxl)

path = "Excel/700-799/750/750 Data Alignment.xlsx"
input = read_excel(path, range = "A1:A21")
test  = read_excel(path, range = "C2:E6")

result = input %>%
  mutate(group = cumsum(str_detect(Data, "Emp ID"))) %>% 
  mutate(cn = ifelse(row_number() > n() / 2, "values", "labels"), .by = group) %>%
  pivot_wider(names_from = cn, values_from = Data) %>%
  unnest() %>%
  pivot_wider(names_from = labels, values_from = values) %>%
  select(-group) %>% 
  mutate(across(c("Age", "Emp ID"), as.numeric)) 

all.equal(result, test)
# > [1] TRUE

Logic: Read the workbook ranges needed for the challenge; Derive the required intermediate columns; Aggregate or rank the data at the required grouping level; Reshape the result into the workbook output format.
Strengths: The reshaping step mirrors the workbook output closely instead of forcing extra post-processing.
Areas for Improvement: The solution assumes the workbook layout and selected ranges remain stable, so any structural change in the sheet would require small adjustments.
Gem: The last reshape turns a raw transformation into something that already looks like a report.

import pandas as pd
import numpy as np
import re

path = "700-799/750/750 Data Alignment.xlsx"
input = pd.read_excel(path, usecols="A", nrows=21).astype(str)
test = pd.read_excel(path, usecols="C:E", skiprows=1, nrows=4).rename(columns=lambda x: re.sub(r'\.1$', '', x))

input['group'] = input['Data'].str.contains("Emp ID").cumsum()
input['cn'] = np.where(input.groupby('group').cumcount() < input.groupby('group')['Data'].transform('count') // 2, 'labels', 'values')

labels_rows, values_rows = (input[input['cn'] == x] for x in ['labels', 'values'])

wide = pd.DataFrame({'labels': labels_rows['Data'].values, 'values': values_rows['Data'].values})
wide['group'] = wide['labels'].str.contains("Emp ID").cumsum()

pivoted = wide.pivot(index='group', columns='labels', values='values')
wide = pivoted.reset_index()
result = wide[['Emp ID', 'Name', 'Age']].copy()
result['Emp ID'] = result['Emp ID'].astype(int)
result['Age'] = result['Age'].astype(float)

print(result.equals(test)) # True

The Python version follows the same grouped logic and keeps the transformation explicit in a dataframe pipeline.

Difficulty Level

Medium

The individual steps are manageable, but the correct transformation pattern is not obvious from the raw data.