Text Data Mining Data Visualization Geospatial Analysis
Parse and structure words and phrases within texts in order to identify patterns, trends, topics, similarities, and more. Represent data in a symbolic visual environment in order to help explore and understand datasets (interactive visualization) or to convey a story or point of view (infographic). Use Geographic Information Systems (GIS) tools to question, analyze, and interpret spatial (location-based) and non-spatial (tabular) data to understand relationships, patterns, and trends.
see projects see projects see projects

The mission of the Baylor Libraries’ Digital Scholarship Program is to integrate digital research tools and techniques into campus research and curricula.

Most recent 25 posts

  • Data & Digital Scholarship Workshops Fall 2022
    Interested in the Data Sciences and the Digital Humanities? Here are our in-person workshops scheduled for Fall 2022. From text data mining to Python scripting to geospatial analysis and 3D printing, hopefully there is something for everyone!
  • Identify Indirect Voice Sentences - Spanish, Latin
    What does this tool do? Identify sentences in uploaded documents that use the indirect voice. Currently this tool supports Spanish and Latin. How does the tool do this? Spanish: Looks for variations of decir with indirect pronouns to classify as…
  • Image identification using the Binary Robust Invariant Scalable Keypoints (BRISK) Method
    What does this tool do? Finds images in video files. You provide image(s) and video(s) and this tool will create an Excel file visually showing the potential matches, the confidence of the match, and the timestamp in the video. How…
  • List Frequencies in Documents
    What does this tool do? Count keywords/keyphrases from a corpus of documents. There are two ways you can provide a list or multiple lists of keywords. Enter the keywords in the Terms textbox, separated by commas. Upload an Excel workbook…
  • Writing Style Similarity: Calculate Probability of Authorship Based on an Implementation of Stylometry
    What does this tool do? Calculates the probability of a document’s authorship based on writing styles. This field of analysis is called Stylometry. (Stylometry - Wikipedia) How does the tool do this? This tool uses the Burrow’s Delta Method to…
  • Accessing NodeXL on Microsoft Azure for Mac Users
    Steps for Mac Users Using NodeXL: These steps will install NodeXL on your instance of the Virtual Desktop (VDI) - You only need to follow these steps once You will need permission to access the virtual desktop. If you have not yet done so, make sure to send an email to helpdesk@baylor.edu from your Baylor…
  • Deep Learning to Identify Human Settlements from Landsat Imagery
    Need to identify human settlement areas from Landsat satellite images? Follow these steps using ArcGIS Pro. "Human settlements maps are useful in understanding growth patterns, population distribution, resource management, change detection, and a variety of other applications where information related to earth surface is required. Human settlements classification is a complex exercise and is hard…
  • Congratulations Data Scholars 2020-21 and 2021-22 !!
    Click here to Register Come celebrate the incredible 150+ Baylor researchers who completed the requirements of the Library’s Data Scholar Program in the last two years. These researchers completed Library-created on-demand modules in data visualization, text data mining, Python scripting, research data management, and working with…
  • Data Research Fellows 2021
    Fundamentals of Data Research (FDR) Fellows 2021 Come join us on September 10 for this year’s Data Research Fellow presentations! Managed by the Library’s Data & Digital Scholarship program, fifteen fellows (ten faculty, five graduate students) learned how to integrate data science methods into their research.…
  • Recogito Tagging to spaCy Trained Model
    Recogito Tagging to spaCy Trained Model https://colab.research.google.com/drive/1xB0MVhC1vTvXdlM5iNym_BLCUzU_rE7X?usp=sharing Problem: Researchers manually tagging text content using Recogito may want to train a named entity recognition model based their manual tags. An example of this is where a researcher may collect documents using Gale's Digital Scholar Lab and may want to train their own named entity recognition model.…
  • Thumbnail Image of Dashboard
    Mapping the Attention of Waco-Area Residents From 1916-1918
    “Mapping the Geographic Attention of Waco-Area Residents from 1916-1918” Presentation as part of the Texas Map Society Spring 2021 Meeting To understand which parts of the world attracted the attention of Waco-area residents leading up to the U.S. involvement in WWI, this research examines…
  • baylor university mark
    Butane: Baylor University Transcript Analytics & Exploration Tool
    Butane: Baylor University Transcript Analytics & Exploration Tool provided by Baylor University Click to Launch BUTANE Do you have interview transcripts in Word or PDF format? Want to mine and visualize these transcripts for word frequencies, parts of speech, named entities, sentiment, topics, and words…
  • Creations: Celebrating Transformational Research & Scholarship
    Creations: Celebrating Transformational Research & Scholarship
    Creations: Celebrating Transformational Research & Scholarship View Power BI Dashboard & Message from President Linda Livingstone Dashboard contains Creations content from 2018-2020 Our annual Creations exhibition supports the research ambitions articulated in Baylor's academic strategic plan, Illuminate, and…
  • workshop logo
    Intensive Training: Learn to Text Data Mine Using Jupyter Notebooks on Google Colab
    Materials to accompany the ER&L 2021 workshop 'Learn to Text Data Mine Using Jupyter Notebooks on Google Colab' Presenter: Joshua Been, Baylor University Libraries Materials: View Workshop Materials on GitHub About: This asynchronous and self-paced workshop is organized into 4 sections and 7…
  • Webinar: Hands-On Tableau Desktop: Mapping the 2016 Presidential Election
    Date/Time: January 12, 11am Title: Hands-On Tableau Desktop: Mapping the 2016 Presidential Election Description: Participants of this introductory webinar will gain hands-on experience generating interactive maps and map-centric dashboards using Tableau Desktop. The content we will focus on during this webinar will be 2016 Presidential election results by…
  • Spring 2021: Data Scholar Workshop Modules
    (1) Take Workshops, (2) Pass Quizzes, (3) Become a Data Scholar What is the Data Scholar Program? The Baylor University Library’s Data Scholar Program is a collection of self-paced data & digital scholarship video modules designed specifically to meet the needs of the Baylor research community. Modules are offered in the following 5 categories: (1)…
  • Data Viz of the Week #15: Baylor Biblical Art
    Click to Launch DVotW (Data Viz of the Week) #15 Published: Sunday, December 06, 2020 Title: Baylor Biblical Art Contributors: Beth Farwell, Joshua Been Data: St. John's Bible Data Visualization Software: Microsoft Power BI
  • Data Viz of the Week #14: Baylor University COVID-19 Dashboard
    Click to Launch DVotW (Data Viz of the Week) #14 Published: Wednesday, November 25, 2020 Title: Baylor University COVID-19 Dashboard Contributors: Joshua Been Data: Baylor University Data Visualization Software: Microsoft Power BI
  • Jupyter Notebook: NCapture Twitter Word Frequencies
    NCapture Twitter Word Frequencies Access Jupyter Notebook on Google Colab here: https://colab.research.google.com/drive/1a-U8Bx1KvPrLaLNkM3eMJyJQ7183IchE?usp=sharing Problem: NVivo sets all NCapture record columns from Twitter as Classifying and only the Tweet itself as Codable. For researchers wanting word frequencies and wordclouds from a different field, such as the hashtags, the optimal method is…
  • Data Viz of the Week #13: World's 100 Tallest Buildings
    Click to Launch DVotW (Data Viz of the Week) #13 Published: Monday, November 15, 2020 Title: World's 100 Tallest Buildings Contributors: Ken Carriveau, Joshua Been Data: The Skyscraper Center Data Visualization Software: Microsoft Power BI
  • Data Viz of the Week #12: Top 15 Bachelor's Degree Categories Awarded by Baylor University 2012-2017
    Click to Launch DVotW (Data Viz of the Week) #12 Published: Monday, November 09, 2020 Title: Top 15 Bachelor's Degree Categories Awarded by Baylor University 2012-2017 Contributors: Ellen Filgo, Joshua Been Data: IPEDS (Integrated Postsecondary Education Data System)
  • Data Viz of the Week #11: Early Voting by County by Date 2020
    Click to Launch DVotW (Data Viz of the Week) #11 Published: Monday, November 02, 2020 Title: Early Voting by County by Date 2020 Contributors: Joshua Been Data: Texas Secretary of State
  • Data Viz of the Week #10: What are the Favorite Halloween Candies in Your State?
    Click to Launch DVotW (Data Viz of the Week) #10 Published: Sunday, November 01, 2020 Title: What are the Favorite Halloween Candies in Your State? Contributors: Carol Schuetz, Joshua Been Data: https://www.candystore.com/
  • Data Viz of the Week #9: Analyzing the Use of 'Anti-Lynching' using HTRC+Bookworm
    Data Visualization with the HTRC’s Bookworm Tool by Eileen Bentsen, Librarian - English, History, Honors College, & Medical Humanities On October 26, 1921, President Harding gave a speech in Alabama condemning lynching. While the speech, by today’s standards, would be considered far short of the mark of civil rights (Pres. Harding sought only political and…
  • Data Viz of the Week #8: Altmetrics for the Creative and Performing Arts
    DVotW (Data Viz of the Week) #8 Published: Monday, October 12, 2020 Title: Altmetrics for the Creative and Performing Arts Contributors: Christina Chan-Park, Sha Towers, Clayton Crenshaw, Joshua Been Data: Data collected in Spring 2020 by Christina Chan-Park, Clayton Crenshaw, and Sha Towers for the project “Identifying a Possible Suite of (alt)metrics for Creative and…