Knowledge (XXG)

NCSA Brown Dog

Source 📝

297:. The CZ provides a unifying framework for integrating terrestrial surface and near-surface environments, and reflects an intricate web of biological and chemical processes and human impacts occurring at vastly different temporal and spatial scales. The nature of these data create significant challenges for inter-disciplinary studies of the CZ because integration of the variety and number of data products and models has been a barrier. On the other hand, CZ data provides an excellent opportunity for defining, testing and implementing Brown Dog technologies. In this context "unstructured" data is viewed broadly as consisting of a collection of heterogeneous data with formats that reflect temporal and disciplinary legacies, data from emerging low cost open hardware based sensors and embedded sensor networks that lack well defined metadata and sensor characteristics, as well as data that are available as maps, images and text. 289:(CZ) is the "skin" of the earth that extends from the treetops to the bedrock that is created by life processes working at scales from microbes to biomes. The Critical Zone supports all terrestrial living systems. Its upper part is the bio-mantle. This is where terrestrial biota live, reproduce, use and expend energy, and where their wastes and remains accumulate and decompose. It encompasses the soil, which acts as a geomembrane through which water and solutes, energy, gases, solids, and organisms interact with the atmosphere, biosphere, hydrosphere, and lithosphere. A variety of drivers affect this bio-dynamic zone, ranging from climate and deforestation to agriculture, grazing and human development. Understanding and predicting these effects is central to managing and sustaining vital 235:
require that terrestrial biosphere and hydrologic models are able to assimilate the large amount of long-tail data that exists but is largely inaccessible. The Brown Dog team in cooperation with researches from Dietze's lab will facilitate the capture of a huge body of smaller research-oriented vegetation data sets collected over many decades and historical vegetation data embedded in Public Land Survey data dating back to 1785. This data will be used as initial conditions for models, to make sense of other large data sets and for model calibration and validation.
269:
on identified areas of the Green Healthy Neighborhood Planning region within the City of Chicago where existing local sewer performance is most deficient and where changes in impervious area through green infrastructure would be beneficial to under served neighborhoods. Brown Dog will be used to extract long-tail experimental data on human landscape preferences and health impacts. This data will be used to develop a human health impacts model that will then be linked together with a terrestrial biosphere model and a storm water model using Brown Dog technology.
22: 126:
so-called "long tail" data, both past and present, has the potential to inform future research in many study areas. Much of this data has become inaccessible due to obsolete software and file formats. The resulting impossibility of reviewing data from older research disrupts the overall scientific research project.
135:
provenance-preserving manner to create a service that can deal with as much of this data as possible. The project sees the broader impact of its work in its potential to serve the general public as a sort of "DNS for data", with the goal of making all data and all file formats as accessible as webpages are today.
268:
design criteria and models that integrate requirements for storm water management and ecosystem and human health and well being. To address the scientific and social problems associated with the design of green spaces, data accessibility and availability is a major challenge. This study will focus
309:
The award amount was $ 10,519,716.00, the largest DIBB award. The principal investigator is Kenton McHenry of NCSA at the University of Illinois at Urbana-Champaign. Coleaders are Jong Lee NCSA/UIUC; Barbara Minsker, Civil and Environmental Engineering, University of Illinois at Urbana-Champaign;
143:
Brown Dog seeks to address problems involving the use of uncurated and unstructured data collections through the development of two services: the Data Access Proxy (DAP) to aid in the conversion of file formats and the Data Tilling Services (DTS) for the automatic extraction of metadata from file
234:
Data on the abundance, species composition, and size structure of vegetation is critically important for a wide array of sub-disciplines in ecology, conservation, natural resource management, and global change biology. However, addressing many of the pressing questions in these disciplines will
157:
for files similar to the dropped file. For example, while browsing an online image collection, a user could drop an image of three people into the search field, and the DTS would return all images in the collection that also contain three people. If DTS encounters a foreign file format, it will
134:
Brown Dog describes itself as the "super mutt" of software (thus the name "Brown Dog"), serving as a low-level data infrastructure to interface digital data content across the internet. Its approach is to use every possible source of automated help (i.e., software) in existence in a robust and
125:
and uncurated and thus not easily shared. Such data is sometimes referred to as "long tail" data. This borrows a term from statistics and refers to the tail of the distribution of project sizes. The majority of smaller projects lack the resources to properly steward the data they produce. This
152:
Data Tilling Service (DTS) will allow users to search data collections using an existing file to discover other similar files in a collection. A DTS search field will be appended to configured browsers where example files can be dropped. This tells DTS to search all the files under a given
177:
would first be examined by DAP to determine if the native file format is readable on the client device. If not, DAP converts the file into the best available format readable by the client machine. Alternatively, the user could specify the desired format themselves.
96:
partners program funded by NSF in 2008. DataNet was conceived to address the increasingly digital and data-intensive nature of science, engineering and education. Brown Dog is part of a follow-on effort called
158:
utilize DAP to make the file accessible. DTS also indexes the data and extract and appends metadata to files and collections enabling users to gain some sense of the type of data they are encountering.
76:
is a research project to develop a method for easily accessing historic research data stored in order to maintain the long-term viability of large bodies of scientific research. It is supported by the
413: 310:
Praveen Kumar, Civil and Environmental Engineering, University of Illinois at Urbana-Champaign; Michael Dietze, Department of Earth and Environment, Boston University.
77: 466: 110: 98: 144:
contents. Once developed, researchers and general public users will be able to download browser plugins and other tools from the Brown Dog tool catalog.
102: 306:
CIF21 DIBBs: Brown Dog was awarded in the winter of 2013 with a start date of October 1, 2013. Estimated expiration date is September 30, 2018.
543: 360: 38: 60: 31: 169:
Data Access Proxy (DAP) allows users to access data files that would otherwise be unreadable. Similar to an internet gateway or
293:
such as soil fertility, water purification, and production of food resources, and, at larger scales, global carbon cycling and
387: 492: 81: 548: 197:
research communities. Developers and researchers from these communities will work together on use cases that span
538: 286: 101:, focused on building software to support DataNet. The project was proposed by researchers at NCSA and the 173:, the DAP configuration would be entered into a user's machine and browser settings. Data requests over 294: 265: 42: 290: 170: 467:"BU Scientist, Collaborators Get $ 10.5 Million Grant to Develop Software for un-Curated Data" 122: 106: 440: 210: 37:
It may require cleanup to comply with Knowledge (XXG)'s content policies, particularly
532: 257: 202: 414:"U of I researchers get millions for 'super mutt' to sniff out big-data trends" 198: 240:
Designing green infrastructure considering storm water and human requirements
190: 279: 206: 93: 521: 335: 245: 253: 174: 222: 217:
Long tail vegetation data in ecology and global change biology
154: 15: 388:"NCSA Project Aims to Create a DNS-Like Service for Data" 194: 249: 30:
A major contributor to this article appears to have a
274:
Development and application for critical zone studies
228: 361:"DataUp—Data Curation for the Long Tail of Science" 473:. Boston University College of Arts and Sciences 284: 78:National Center for Supercomputing Applications 256:, University of Illinois at Urbana-Champaign; 282:, University of Illinois at Urbana-Champaign 260:, University of Illinois at Urbana-Champaign 8: 111:University of North Carolina at Chapel Hill 99:Data Infrastructure Building Blocks (DIBBs) 264:This case study involves developing novel 250:University of Illinois at Urbana-Champaign 61:Learn how and when to remove this message 493:"Award#1261582 - CIF21 DIBBs: Brown Dog" 330: 328: 326: 324: 322: 318: 117:Unstructured, uncurated, long tail data 103:University of Illinois Urbana-Champaign 367:. Microsoft Research Connections Team 7: 365:Microsoft Research Connections Blog 14: 121:Much scientific data is smaller, 181:This service runs on port 8184. 161:This service runs on port 9443. 41:. Please discuss further on the 20: 386:Woodie, Alex (6 January 2014). 193:proposed by groups within the 1: 412:Pletz, John (December 2013). 80:(NCSA) that is funded by the 105:as well as researchers from 544:National Science Foundation 420:. Crain Communications, Inc 82:National Science Foundation 565: 92:Brown Dog is part of the 278:This use case is led by 244:This use case is led by 221:This use case is led by 189:Brown Dog targets three 299: 271: 237: 262: 232: 39:neutral point of view 445:NCSA Access Magazine 295:carbon sequestration 266:green infrastructure 148:Data Tilling Service 171:Domain Name Service 291:ecosystem services 549:Research projects 439:Jewett, Barbara. 229:Boston University 165:Data Access Proxy 107:Boston University 71: 70: 63: 34:with its subject. 556: 525: 524: 522:Official website 508: 507: 505: 503: 489: 483: 482: 480: 478: 471:www.newswise.com 463: 457: 456: 454: 452: 436: 430: 429: 427: 425: 418:Chicago Business 409: 403: 402: 400: 398: 383: 377: 376: 374: 372: 357: 351: 350: 348: 346: 332: 254:William Sullivan 66: 59: 55: 52: 46: 32:close connection 24: 23: 16: 564: 563: 559: 558: 557: 555: 554: 553: 539:Data management 529: 528: 520: 519: 516: 511: 501: 499: 491: 490: 486: 476: 474: 465: 464: 460: 450: 448: 441:"DATA SET FREE" 438: 437: 433: 423: 421: 411: 410: 406: 396: 394: 385: 384: 380: 370: 368: 359: 358: 354: 344: 342: 334: 333: 320: 316: 304: 276: 246:Barbara Minsker 242: 219: 187: 167: 150: 141: 132: 119: 90: 67: 56: 50: 47: 36: 25: 21: 12: 11: 5: 562: 560: 552: 551: 546: 541: 531: 530: 527: 526: 515: 514:External links 512: 510: 509: 484: 458: 431: 404: 378: 352: 340:NCSA Brown Dog 317: 315: 312: 303: 300: 275: 272: 258:Arthur Schmidt 241: 238: 224:Michael Dietze 218: 215: 211:social science 186: 183: 166: 163: 149: 146: 140: 137: 131: 128: 118: 115: 89: 86: 74:NCSA Brown Dog 69: 68: 28: 26: 19: 13: 10: 9: 6: 4: 3: 2: 561: 550: 547: 545: 542: 540: 537: 536: 534: 523: 518: 517: 513: 498: 494: 488: 485: 472: 468: 462: 459: 446: 442: 435: 432: 419: 415: 408: 405: 393: 389: 382: 379: 366: 362: 356: 353: 341: 337: 331: 329: 327: 325: 323: 319: 313: 311: 307: 301: 298: 296: 292: 288: 287:Critical Zone 283: 281: 280:Praveen Kumar 273: 270: 267: 261: 259: 255: 251: 247: 239: 236: 231: 230: 226: 225: 216: 214: 212: 208: 204: 200: 196: 192: 184: 182: 179: 176: 172: 164: 162: 159: 156: 147: 145: 138: 136: 129: 127: 124: 116: 114: 112: 108: 104: 100: 95: 87: 85: 83: 79: 75: 65: 62: 54: 44: 40: 35: 33: 27: 18: 17: 500:. Retrieved 496: 487: 475:. Retrieved 470: 461: 449:. Retrieved 444: 434: 422:. Retrieved 417: 407: 395:. Retrieved 391: 381: 369:. Retrieved 364: 355: 343:. Retrieved 339: 308: 305: 285: 277: 263: 243: 233: 223: 220: 188: 180: 168: 160: 151: 142: 133: 123:unstructured 120: 91: 73: 72: 57: 51:January 2020 48: 29: 336:"Brown Dog" 203:engineering 533:Categories 314:References 199:geoscience 139:Technology 302:NSF Award 195:EarthCube 191:use cases 185:Use cases 43:talk page 477:7 August 451:7 August 424:7 August 397:7 August 392:datanami 371:7 August 130:Approach 109:and the 502:31 July 497:nsf.gov 345:31 July 207:biology 94:DataNet 88:History 84:(NSF). 447:. NCSA 504:2014 479:2014 453:2014 426:2014 399:2014 373:2014 347:2014 209:and 175:HTTP 252:; 155:URL 535:: 495:. 469:. 443:. 416:. 390:. 363:. 338:. 321:^ 248:, 227:, 213:. 205:, 201:, 113:. 506:. 481:. 455:. 428:. 401:. 375:. 349:. 64:) 58:( 53:) 49:( 45:.

Index

close connection
neutral point of view
talk page
Learn how and when to remove this message
National Center for Supercomputing Applications
National Science Foundation
DataNet
Data Infrastructure Building Blocks (DIBBs)
University of Illinois Urbana-Champaign
Boston University
University of North Carolina at Chapel Hill
unstructured
URL
Domain Name Service
HTTP
use cases
EarthCube
geoscience
engineering
biology
social science
Michael Dietze
Boston University
Barbara Minsker
University of Illinois at Urbana-Champaign
William Sullivan
Arthur Schmidt
green infrastructure
Praveen Kumar
Critical Zone

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.