I have list of order numbers, each containing ordered SKU's(One order may have multiple SKU's) and their ordered Quantities. Now after initial analysis I found that 16/550 SKU;s are there in 50% Orders. Also 5 or more of these 16 SKU's are there in 20% orders. Question: I want to identify top 15-20 SKU's which are present in most of Orders in significant volume(High Quantity). I heard about cluster analysis can cluster these high perfroming SKU's. Can anyone help how to find these high performig SKU's? This concept is known as forward area in warehousing. Thanks Source: Sample Raw data Material Code, Delivery Number, Quantity ordered
Thanks