Skip to Main Content
University of Texas University of Texas Libraries

DS Pedagogy Institute: AI Tools and Class Intergration

This guide is for the Digital Scholarship Pedagogy Institute, providing an overview of using various AI tools for reading and annotating articles, as well as creating visualizations and conducting text analysis.

Microsoft Copilot

What is Microsoft Copilot?

Microsoft Copilot is a generative artificial intelligence chatbot developed by Microsoft, similar to ChatGPT. Copilot utilizes the ChatGPT model 4 for free and DALL·E-3, an image generator. Copilot is a generative AI model that gathers data from online resources and trains itself based on that data. Its goal is to answer questions based on user input.

Microsoft Copilot vs. ChatGPT

Microsoft Copilot

Microsoft Copilot can generate text and images. It offers three conversation settings: more creativemore balanced, and more precise. Additionally, it prioritizes privacy protection by not selling data to external companies or using user conversations to train ChatGPT 4 or DALL-E.

However, please note that Co-pilot retains your conversation history for internal training purposes. To delete your conversation history follow Microsoft's instructions on deleting interaction history

To gain a better understanding of Microsoft’s policies and security, I recommend reading their Privacy and Security FAQ. 

ChatGPT 4

ChatGPT 4 is a free generative AI tool that leverages online data to train itself. The data you provide during conversations contributes to its self-training.

Using Microsoft Copilot for Simple Text Mining

Text mining is a computational method for analyzing and extracting data from a corpus. It is essential for researchers because it allows automatic recognition and extraction of data from small or large corpora, which can then be further analyzed. Text mining assists in examining themes, creating connections, and posing questions.

Microsoft Copilot is not intended to replace tools that automatically perform text mining. Instead, it can support the process as a complementary tool or to conduct quick anaylsis on a text. 


Optical Character Recognition

OCR technology converts images or scans into machine-readable text. If you’re working with materials like newspapers, articles, or handwritten text, it’s essential to OCR them before conducting text mining.

With Microsoft Copilot, you can upload images that contain articles without columns, provided they are in PNG format. It works well with both English and Spanish text.

Practice Text


Translation

When dealing with non-English text, translating the document is necessary in some research. While there is an immense amount of paid software, Microsoft Copilot can provide a draft version of free translation for simple texts

El Despertador Practice Text


Name Entity Recognition

NER identifies names, places, events, and dates within a document. It tags and highlights this information, allowing researchers to understand trends, themes, and events in the article.

Microsoft Copilot provides you with a list of names, places, events, and dates, which helps provide a quick overview of the text before conducting a closer analysis.

El Despertado Practice Text


N-grams

N-grams are chunks of words extracted from a text. These chunks often consist of two common words or phrases. For example:

  • Los Angeles
  • Jane Austen
  • I like cooking

 The purpose of using N-grams is to predict words in a text and provide cultural context.

Microsoft Copilot can recognize common chunks and provide a list of those chunks.

Great Gatsby Practice Text

 

Activity

Select one or all of the articles from La Guardia, a Milwaukee-based newspaper, and conduct Named Entity Recognition and N-grams analysis using Microsoft Copilot.

  • Who or what are the key entities mentioned in the text?
  • What words or phrases are recurring in the text, and what information do they provide about the text?
  • If you ran all three texts through Microsoft Copilot, what would be La Guardia's main theme?

Now that we have pre-read the text using Microsoft Copilot, read the text.

  • Did the pre-reading activity provide some context before reading the text?
  • Did it help you further understand the main characters/people and themes occurring

Articles: 

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 Generic License.