Knowledge (XXG)

SEMMA

Source 📝

137:
SEMMA mainly focuses on the modeling tasks of data mining projects, leaving the business aspects out (unlike, e.g., CRISP-DM and its Business Understanding phase). Additionally, SEMMA is designed to help the users of the SAS Enterprise Miner software. Therefore, applying it outside Enterprise Miner
138:
may be ambiguous. However, in order to complete the "Sampling" phase of SEMMA a deep understanding of the business aspects would have to be a requirement in order to do effective sampling. So, in effect, a business understanding would be required to effectively complete sampling.
60:
applications. Although SEMMA is often considered to be a general data mining methodology, SAS claims that it is "rather a logical organization of the functional tool set of" one of their products, SAS Enterprise Miner, "for carrying out the core tasks of data mining".
100:, e.g., selecting the data set for modeling. The data set should be large enough to contain sufficient information to retrieve, yet small enough to be used efficiently. This phase also deals with data partitioning. 78: 122:. In the Model phase the focus is on applying various modeling (data mining) techniques on the prepared variables in order to create models that possibly provide the desired outcome. 106:. This phase covers the understanding of the data by discovering anticipated and unanticipated relationships between the variables, and also abnormalities, with the help of 192: 147: 74: 167: 208: 73:
for the diversified and iterative process of data mining that users can apply to their data mining projects regardless of industry. While the
171: 128:. The last phase is Assess. The evaluation of the modeling results shows the reliability and usefulness of the created models. 189: 242: 81:
initiative, aimed to create a neutral methodology, SAS also offered a pattern to follow in its data mining tools.
116:. The Modify phase contains methods to select, create and transform variables in preparation for data modeling. 69:
In the expanding field of data mining, there has been a call for a standard methodology or a simple list of
97: 53: 224: 107: 196: 175: 70: 236: 45: 170:. In Proceedings of the IADIS European Conference on Data Mining 2008, pp 182-185. 57: 209:
A Proposed Data Mining Methodology and its Application to Industrial Procedures
49: 226:
KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW, Ana Azevedo and M.F. Santos
79:
European Strategic Program on Research in Information Technology
89:
The phases of SEMMA and related tasks are the following:
44:. It is a list of sequential steps developed by 148:Cross Industry Standard Process for Data Mining 75:Cross Industry Standard Process for Data Mining 8: 168:KDD, SEMMA and CRISP-DM: a parallel overview 56:software. It guides the implementation of 185: 183: 207:Rohanizadeh, S. S. and Moghadam, M. B. 159: 7: 211:Journal of Industrial Engineering 48:, one of the largest producers of 14: 24:is an acronym that stands for 166:Azevedo, A. and Santos, M. F. 1: 190:SAS Enterprise Miner website 77:or CRISP-DM, founded by the 259: 96:. The process starts with 174:January 9, 2013, at the 195:March 8, 2012, at the 54:business intelligence 243:Applied data mining 16:Data mining process 108:data visualization 250: 227: 222: 216: 215:(2009) pp 37-50. 205: 199: 187: 178: 164: 258: 257: 253: 252: 251: 249: 248: 247: 233: 232: 231: 230: 223: 219: 206: 202: 197:Wayback Machine 188: 181: 176:Wayback Machine 165: 161: 156: 144: 135: 87: 85:Phases of SEMMA 67: 17: 12: 11: 5: 256: 254: 246: 245: 235: 234: 229: 228: 217: 200: 179: 158: 157: 155: 152: 151: 150: 143: 140: 134: 131: 130: 129: 123: 117: 111: 101: 86: 83: 71:best practices 66: 63: 15: 13: 10: 9: 6: 4: 3: 2: 255: 244: 241: 240: 238: 225: 221: 218: 214: 210: 204: 201: 198: 194: 191: 186: 184: 180: 177: 173: 169: 163: 160: 153: 149: 146: 145: 141: 139: 132: 127: 124: 121: 118: 115: 112: 109: 105: 102: 99: 98:data sampling 95: 92: 91: 90: 84: 82: 80: 76: 72: 64: 62: 59: 55: 51: 47: 46:SAS Institute 43: 39: 35: 31: 27: 23: 22: 220: 212: 203: 162: 136: 125: 119: 113: 103: 93: 88: 68: 41: 37: 33: 29: 25: 20: 19: 18: 58:data mining 154:References 65:Background 50:statistics 133:Criticism 237:Category 193:Archived 172:Archived 142:See also 104:Explore 30:Explore 126:Assess 114:Modify 94:Sample 42:Assess 40:, and 34:Modify 26:Sample 120:Model 38:Model 21:SEMMA 52:and 239:: 182:^ 36:, 32:, 28:, 213:4 110:.

Index

SAS Institute
statistics
business intelligence
data mining
best practices
Cross Industry Standard Process for Data Mining
European Strategic Program on Research in Information Technology
data sampling
data visualization
Cross Industry Standard Process for Data Mining
KDD, SEMMA and CRISP-DM: a parallel overview
Archived
Wayback Machine


SAS Enterprise Miner website
Archived
Wayback Machine
A Proposed Data Mining Methodology and its Application to Industrial Procedures

Category
Applied data mining

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.