Knowledge

Talk:Distance matrix

Source 📝

133: 77: 53: 22: 378:
topics). Given such a dicotomy of approaches to the topic, we should not be making a choice here but rather attempting to describe both views. I reverted your recent edits so as not to lose this other viewpoint, but the article certainly needs work. Finding some reliable sources for each viewpoint would be a great start.
258:
From a mathematical perspective, if it's not symmetric, it's not really a distance matrix. Intuitively, distance implies the distance from a to b should be equivalent to the distance from b to a. If this doesn't hold, I don't think it should be called a distance matrix. This property is not exclusive
293:
Can you think of applications that use the term loosely? In my experience, distance implies some structure in the matrix that can then be used by algorithms for clustering or tree building. If a matrix doesn't actually have any of this structure, I don't think the term distance should be used. Maybe
274:
This article is a bit of a mess. The opening sentence clearly talks about distance as a metric, but the applications in bioinformatics and related fields use distance with a looser non-metric meaning (and while they even talk about metrics, these are not the mathematically defined terms). In these
361:
It makes the claim that the matrix need not be symmetric and that it need not be hollow. The rules 2 and 3 under formalization directly contradict this. They state precisely in no uncertain terms that the matrix must be both symmetric and hollow. It also seems to want to allow for complex valued
377:
I believe that you made the wrong choice here. The "Formalization" section was only added recently, and, as you point out, contradicted material that was already on the page for a long while. The application areas seem to use this concept with a non-metric distance in mind (look at the See also
418:
We currently have the applications of a "distance matrix" spread in sections "Bioinformatics", "Data Mining and Machine Learning", "Information retrieval", "Chemistry", and "Other Applications". Some of these about specific definitions of distances, some talk about specific algorithm run on a
422:
I believe we should talk keep field-specific definition of distances inside the "field" sections, but move the algorithms out to a more general section. Right now we are basically mentioning clustering algorithms twice, once as phylogeny, once as data-mining.
173:
with the merger. Adjacency matrices are simply a way to tell whether two vertices are connected or not, and by how many paths. Distance matrices give the distances between two vertices, not whether they are connected or not.
142: 123: 63: 363: 533: 147: 523: 362:
metrics which is not in line with the standard definition of a metric. As it stands it is a complete mess. The article should at least be self consistent even if it is inaccurate.--
117: 275:
applications, distances can be negative and don't have to be symmetric. I could help fixing up the mathematical side of this topic, but the other applications are beyond my ken.
538: 528: 312:
While searching for something more explicit to give you I stumbled upon the shortest path problem in networks which is a clear example. I've amended the article accordingly.
543: 396:
To be fair, I removed it, but after reading your comment, I agree that covering both points of view is the best way forward, at least until we uncover more sources.
518: 93: 208:
Too bad no one of you added any information about adjacency matrices to the article - at least that should have been done before removing the tags... --
294:
in the math definition non-negativity is too restrictive because transformations can usually resolve this, but I think asymmetry should at least hold.
84: 58: 198:
with the merger. However, these topics are somewhat related and there should be a sentence or two describing their relationship. --
367: 254:
Unlike a Euclidean distance matrix, the matrix does not need to be symmetric—that is, the values xi,j do not necessarily equal xj,i.
500: 239: 132: 33: 21: 228: 357:
The section "Comparison with Euclidean distance matrix" contradicts the "Formalization" section completely
186: 39: 333:
Becker, R. A., Chambers, J. M. and Wilks, A. R. (1988) The New S Language. Wadsworth & Brooks/Cole.
224: 401: 347: 299: 264: 496: 339:
Borg, I. and Groenen, P. (1997) Modern Multidimensional Scaling. Theory and Applications. Springer.
92:
on Knowledge. If you would like to participate, please visit the project page, where you can join
397: 383: 343: 317: 295: 280: 260: 89: 182: 493: 76: 52: 336:
Mardia, K. V., Kent, J. T. and Bibby, J. M. (1979) Multivariate Analysis. Academic Press.
223:
with the merger. This would leave out the important Mahalanobis distance, which see.
512: 379: 313: 276: 199: 209: 504: 405: 387: 371: 351: 321: 303: 284: 268: 232: 212: 202: 189: 245:
These topic are closely related. Could we merged them in some way?
485:
Basically everything here are definitions of distance functions.
426:
My current preferred outline to rewrite into would look like:
15: 131: 88:, a collaborative effort to improve the coverage of 472:Gaussian mixture distance, defined over documents 122:This article has not yet received a rating on the 252: 534:High-importance Computational Biology articles 524:Unknown-importance Molecular Biology articles 461:Among sequences: alignment and alignment-free 8: 19: 539:WikiProject Computational Biology articles 529:Start-Class Computational Biology articles 47: 544:All WikiProject Molecular Biology pages 419:distance matrix, some talk about both. 102:Knowledge:WikiProject Molecular Biology 49: 519:Start-Class Molecular Biology articles 105:Template:WikiProject Molecular Biology 436:Hierarchical clustering and phylogeny 7: 364:2003:69:CD3F:B01:2876:80EC:DC6E:7C68 249:Distance matrix should be symmetric. 143:the Computational Biology task force 82:This article is within the scope of 38:It is of interest to the following 14: 475:Comparison with cosine similarity 455:Definition of distance functions 447:Neighborhood Retrieval Visualizer 430:Algorithms on a distance matrix 185:. These are different concepts. 75: 51: 20: 1: 203:13:54, 23 February 2006 (UTC) 190:13:58, 21 February 2006 (UTC) 181:with the propsed merger with 140:This article is supported by 96:and see a list of open tasks. 85:WikiProject Molecular Biology 406:14:56, 22 October 2015 (UTC) 388:18:42, 20 October 2015 (UTC) 372:12:19, 20 October 2015 (UTC) 352:18:20, 13 October 2015 (UTC) 322:02:46, 21 October 2015 (UTC) 304:16:36, 17 October 2015 (UTC) 285:05:30, 14 October 2015 (UTC) 269:17:59, 13 October 2015 (UTC) 505:08:05, 4 January 2024 (UTC) 259:to Euclidean space either. 233:08:13, 12 August 2008 (UTC) 560: 414:Restructure "applications" 213:18:03, 30 April 2006 (UTC) 108:Molecular Biology articles 139: 121: 70: 46: 328:References to check out 240:Distance matrix methods 469:Information retrieval 256: 136: 28:This article is rated 135: 450:Dynamic time warping 441:K-nearest neighbors 137: 34:content assessment 503: 162: 161: 158: 157: 154: 153: 99:Molecular Biology 90:Molecular Biology 59:Molecular Biology 551: 499: 464:Among 3D volumes 380:Bill Cherowitzo 314:Bill Cherowitzo 277:Bill Cherowitzo 187:Charles Matthews 183:adjacency matrix 124:importance scale 110: 109: 106: 103: 100: 79: 72: 71: 66: 55: 48: 31: 25: 24: 16: 559: 558: 554: 553: 552: 550: 549: 548: 509: 508: 458:Bioinformatics 416: 359: 330: 251: 243: 167: 148:High-importance 107: 104: 101: 98: 97: 61: 32:on Knowledge's 29: 12: 11: 5: 557: 555: 547: 546: 541: 536: 531: 526: 521: 511: 510: 491: 490: 489: 488: 487: 486: 480: 479: 478: 477: 476: 467: 466: 465: 462: 453: 452: 451: 448: 445: 442: 439: 438: 437: 415: 412: 411: 410: 409: 408: 391: 390: 358: 355: 341: 340: 337: 334: 329: 326: 325: 324: 309: 308: 307: 306: 288: 287: 250: 247: 242: 236: 218: 216: 215: 166: 163: 160: 159: 156: 155: 152: 151: 138: 128: 127: 120: 114: 113: 111: 94:the discussion 80: 68: 67: 56: 44: 43: 37: 26: 13: 10: 9: 6: 4: 3: 2: 556: 545: 542: 540: 537: 535: 532: 530: 527: 525: 522: 520: 517: 516: 514: 507: 506: 502: 498: 495: 484: 483: 481: 474: 473: 471: 470: 468: 463: 460: 459: 457: 456: 454: 449: 446: 443: 440: 435: 434: 432: 431: 429: 428: 427: 424: 420: 413: 407: 403: 399: 395: 394: 393: 392: 389: 385: 381: 376: 375: 374: 373: 369: 365: 356: 354: 353: 349: 345: 338: 335: 332: 331: 327: 323: 319: 315: 311: 310: 305: 301: 297: 292: 291: 290: 289: 286: 282: 278: 273: 272: 271: 270: 266: 262: 255: 248: 246: 241: 238:Merging with 237: 235: 234: 230: 226: 222: 214: 211: 207: 206: 205: 204: 201: 197: 192: 191: 188: 184: 180: 175: 172: 164: 149: 146:(assessed as 145: 144: 134: 130: 129: 125: 119: 116: 115: 112: 95: 91: 87: 86: 81: 78: 74: 73: 69: 65: 60: 57: 54: 50: 45: 41: 35: 27: 23: 18: 17: 492: 425: 421: 417: 360: 342: 257: 253: 244: 225:Pmanleycooke 220: 217: 195: 193: 178: 176: 170: 168: 141: 83: 40:WikiProjects 433:Clustering 30:Start-class 513:Categories 482:Chemistry 221:disagree 200:BAxelrod 196:disagree 179:disagree 171:disagree 165:Untitled 494:Artoria 219:I also 194:I also 64:COMPBIO 444:Isomap 398:Djh901 344:Djh901 296:Djh901 261:Djh901 210:Abdull 36:scale. 402:talk 384:talk 368:talk 348:talk 318:talk 300:talk 281:talk 265:talk 229:talk 497:2e5 118:??? 515:: 501:🌉 404:) 386:) 370:) 350:) 320:) 302:) 283:) 267:) 231:) 177:I 169:I 150:). 62:: 400:( 382:( 366:( 346:( 316:( 298:( 279:( 263:( 227:( 126:. 42::

Index


content assessment
WikiProjects
WikiProject icon
Molecular Biology
COMPBIO
WikiProject icon
WikiProject Molecular Biology
Molecular Biology
the discussion
???
importance scale
Taskforce icon
the Computational Biology task force
High-importance
adjacency matrix
Charles Matthews
13:58, 21 February 2006 (UTC)
BAxelrod
13:54, 23 February 2006 (UTC)
Abdull
18:03, 30 April 2006 (UTC)
Pmanleycooke
talk
08:13, 12 August 2008 (UTC)
Distance matrix methods
Djh901
talk
17:59, 13 October 2015 (UTC)
Bill Cherowitzo

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.