*Links:*

Count values in one column based on another column in DAX

I want to create a calculated column which will contain the sum of occurrences of values in Column A based on the values in Column B.

**Example 1**

**using EARLIER (as a new column)**

**Example 2**

**using FILTER(ALL ()(as a new column)**

**Example 3**

**Earlier in measure**

Earlier function needs 2 nested row contexts to work with. You need to get this function 2 tables. It is not working just with 1 table in a measure.

Occurence =

VAR

RowCount = Sumx(CategoryData,

COUNTROWS(FILTER(ALL(CategoryData),

EARLIER(CategoryData[Subcategory])=CategoryData[Category])))

RETURN

RowCount

**Calculate SUM for a single item**

Salary for Maria = CALCULATE( SUM(DimCustomer[Salary]), FILTER(VALUES(DimCustomer),DimCustomer[Customer]="Maria"))

**Partitioning data by month**

__Calculating average per month.__

Step A

calculating sum per month.

I have partitioned data by Year_month and using ALLEXCEPT(FactSales,FactSales[Year_month])) to calculate sum per month.

Step B

calculating number of transaction per month. Partitioning data by Year_month, using ALLEXCEPT(FactSales,FactSales[Year_month])), the same logic as in step A.

Step C

Calculating average by dividing sum of the group by the number of its data points. Data on the chart are sorted by the Index column, which is in the tooltip.

MEAN By Year_Month =

VAR s =

CALCULATE (

SUM ( FactSales[Sales] ),

ALLEXCEPT ( FactSales, FactSales[Year_month] )

)

VAR d =

CALCULATE (

COUNTROWS ( FactSales ),

ALLEXCEPT ( FactSales, FactSales[Year_month] )

)

RETURN

DIVIDE ( s, d, 0 )

**Calculate running total just for the first 15 days **

*FIRST 15 DAYS CUM_SALE =*

VAR Period =

DATESBETWEEN ( tblSales[Date], DATE (2021, 01, 01), DATE (2021, 1, 15))

VAR _RESULT =

CALCULATE (

SUM ( tblSales[Sales] ),

DATESYTD ( tblSales[Date], "2021-01-15"),

Period

)

RETURN _RESULT

**Calculate running total for selected days.**

*Step 1*

create measure that calculates the running total

*Total Sales** = **CALCULATE**(SUM(tblSales[Sales]),**DATESMTD**(tblSales[Date]))*

Step 2

*create a new table with selected dates *

*HelpTable =*

CALCULATETABLE (

ADDCOLUMNS (VALUES ( tblSales[Date] ), "Total sales", [Total Sales] ),

tblSales[Date] >= DATE (2021, 01, 03)

&& tblSales[Date] <= DATE (2021, 01, 10))

*Step 3*

Create a **relationship **between tblSales and HelpTable.

*Step 4*

*Create a measure with the SUM() function.*

**New Customer Analysis**

*This measure calculates the total number of the customers per year. In 2020 there was 6 distinct customer, in 2021, 10 distinct customer. I want to find out the number of the new customers in 2021, means a **new customer is every customer having no record in 2020*

*Number of Customers per year =*

CALCULATE (

DISTINCTCOUNT ( FactSales[CustomerID] ),

ALLEXCEPT ( FactSales, FactSales[Date].[Year] ))

Step 1

I want to create a measure which will contain these two tables as a virtual tables.

<all customers> table returns all the customers from 2020-2021.

<2020 Customers> table returns just the customers who have some record in 2021.

Then I use the EXECPT() function to retrieve just those customers who occur only in 2021 table.

Step 2

*NEW CUSTOMERS =*

VAR ALL_CUSTOMER = VALUES ( FactSales[CustomerID] )

VAR CUSTOMER_PRIOR_YEAR =

CALCULATETABLE (

VALUES ( FactSales[CustomerID] ),

ALL ( FactSales[Date] ),

DATESBETWEEN ( FactSales[Date], DATE (2020, 01, 01), DATE (2020, 12, 31))

)

VAR RESULT =

COUNTROWS (EXCEPT (ALL_CUSTOMER, CUSTOMER_PRIOR_YEAR))

RETURN RESULT

this is how the table looks like when is filtered by 2021 year.

**Cumulative percentage from delivery days**

*Sample data from Contoso database:*

**Calculating an average of delivery days.**

**Calculating number of distinct orders per delivery days**

**Calculating the grand total of all distinct order.**

**Calculating a ratio of number deliveries per delivery duration to its grand total.**

85% of all deliveries are delivered within the first 4 days. If I order an item on Wednesday morning, on Wednesday or Thursday night will be processed and there is an 85% probability that on Friday my item will be delivered, speaking theoretically.

**Calculating a cumulative percentage of all orders delivered up to 4-day.**
This measure shows a proportion of all orders delivered within 4 days cumulatively.

Creating a **disconnected Days table.** This table will be used in a** ****matrix visual in columns**. I going to iterate over these values, use this table to calculate the cumulative percentage.
SELECTCOLUMNS function takes a table and columns from an existing table. It contains an option to specify our own names (you can rename the original column names) for the newly created / extracted table.

this shows how to use that table with **Up-to-4-day Delivery **measure, put it into columns in matrix.

The next step is to grab the value from **Days table **and use it in a measure.

We can use **SELECTEDVALUE **function which supplies the actual values from **Days table**.

The result we are looking at is displaying the values cumulatively. If we look at value **4** for** Australia** in the first row, we can conclude that up to 4 days 84.3% of deliveries are completed.

Days 6 and 7 are missing because I did not add these values to my table. But you can of course.

As you can see, we got repeating **100% values** we must take care of. And, as you can see **CurrentDaysValue - 1 **did not fix this problem.

**The final measure:**

Cumulative Percentage =

VAR

CurrentDaysValue = SELECTEDVALUE(Days[Days])

VAR

Process =

CALCULATE([Order-to-All Orders Ratio],

FactSales[Duration] <= CurrentDaysValue)

VAR

MaxCurrentDay = CALCULATE(MAX(Days[Days]),Days[Days] < CurrentDaysValue)

VAR

COPYOF_Process =

CALCULATE([Order-to-All Orders Ratio],

FactSales[Duration] <= MaxCurrentDay)

RETURN

IF(COPYOF_Process < 1 , Process)

variable MaxCurrentDay was added

**Adding a conditional formating for values**

CumulativePercFormatting =

var TargetTable =

CALCULATE( ROUND( [Cumulative Perc], 4),

REMOVEFILTERS(), --remove filters.

VALUES(DeliveryTime[Days])) --restore filters for columns in Matrix Vis.

var TargetValue = ROUND([Cumulative Perc],4)

var Result = IF ( TargetValue < TargetTable, -1, +1 )

return

Result

Values below the grand total average are formatted red.

**Grouping a table using SUMMARIZECOLUMN function**

Sample data: Using tables Order and OrderDetails from Northwind database. Tables are joined into one table. I've added a Year column for groupping.

I want to create a forecast for 1998 year. This year finish at 06/05/1998. Each year has 4 quarters. I calculate the sum of 1998 and substract the value which is over Q1. I will get the net value for Q1 and then I multiply that value by 3 assuming all 4 quarters will have the same total.

**Groupping the columns**

OrdersForCAGR =

var TargetTable =

ADDCOLUMNS(

ADDCOLUMNS(

SUMMARIZECOLUMNS(Orders[Year], "Total",SUM(Orders[Total Sales])),

"Quarterly Coefficient", CALCULATE(1/(4/12))),

"Sales Total", IF(Orders[Year] = 1998,

CALCULATE([Sales] * (1/(4/12))) - [Sales 1998 over Q1],

CALCULATE([Sales])))

return

TargetTable

If condition checks if *Year = 1998*. If returns TRUE, sum for 1998 will be multiplied by 3 and I substract the sum which is after 1. April. Values for 1996 and 1997 will be just copied.

Once you have it, you can use it for a simple forecast.

Having a full year for 1998, can calculate CAGR. This is not very accurate with just 3 years of data.

## Comments