More stories

  • in

    A comprehensive study of technological change

    The societal impacts of technological change can be seen in many domains, from messenger RNA vaccines and automation to drones and climate change. The pace of that technological change can affect its impact, and how quickly a technology improves in performance can be an indicator of its future importance. For decision-makers like investors, entrepreneurs, and policymakers, predicting which technologies are fast improving (and which are overhyped) can mean the difference between success and failure.

    New research from MIT aims to assist in the prediction of technology performance improvement using U.S. patents as a dataset. The study describes 97 percent of the U.S. patent system as a set of 1,757 discrete technology domains, and quantitatively assesses each domain for its improvement potential.

    “The rate of improvement can only be empirically estimated when substantial performance measurements are made over long time periods,” says Anuraag Singh SM ’20, lead author of the paper. “In some large technological fields, including software and clinical medicine, such measures have rarely, if ever, been made.”

    A previous MIT study provided empirical measures for 30 technological domains, but the patent sets identified for those technologies cover less than 15 percent of the patents in the U.S. patent system. The major purpose of this new study is to provide predictions of the performance improvement rates for the thousands of domains not accessed by empirical measurement. To accomplish this, the researchers developed a method using a new probability-based algorithm, machine learning, natural language processing, and patent network analytics.

    Overlap and centrality

    A technology domain, as the researchers define it, consists of sets of artifacts fulfilling a specific function using a specific branch of scientific knowledge. To find the patents that best represent a domain, the team built on previous research conducted by co-author Chris Magee, a professor of the practice of engineering systems within the Institute for Data, Systems, and Society (IDSS). Magee and his colleagues found that by looking for patent overlap between the U.S. and international patent-classification systems, they could quickly identify patents that best represent a technology. The researchers ultimately created a correspondence of all patents within the U.S. patent system to a set of 1,757 technology domains.

    To estimate performance improvement, Singh employed a method refined by co-authors Magee and Giorgio Triulzi, a researcher with the Sociotechnical Systems Research Center (SSRC) within IDSS and an assistant professor at Universidad de los Andes in Colombia. Their method is based on the average “centrality” of patents in the patent citation network. Centrality refers to multiple criteria for determining the ranking or importance of nodes within a network.

    “Our method provides predictions of performance improvement rates for nearly all definable technologies for the first time,” says Singh.

    Those rates vary — from a low of 2 percent per year for the “Mechanical skin treatment — Hair removal and wrinkles” domain to a high of 216 percent per year for the “Dynamic information exchange and support systems integrating multiple channels” domain. The researchers found that most technologies improve slowly; more than 80 percent of technologies improve at less than 25 percent per year. Notably, the number of patents in a technological area was not a strong indicator of a higher improvement rate.

    “Fast-improving domains are concentrated in a few technological areas,” says Magee. “The domains that show improvement rates greater than the predicted rate for integrated chips — 42 percent, from Moore’s law — are predominantly based upon software and algorithms.”

    TechNext Inc.

    The researchers built an online interactive system where domains corresponding to technology-related keywords can be found along with their improvement rates. Users can input a keyword describing a technology and the system returns a prediction of improvement for the technological domain, an automated measure of the quality of the match between the keyword and the domain, and patent sets so that the reader can judge the semantic quality of the match.

    Moving forward, the researchers have founded a new MIT spinoff called TechNext Inc. to further refine this technology and use it to help leaders make better decisions, from budgets to investment priorities to technology policy. Like any inventors, Magee and his colleagues want to protect their intellectual property rights. To that end, they have applied for a patent for their novel system and its unique methodology.

    “Technologies that improve faster win the market,” says Singh. “Our search system enables technology managers, investors, policymakers, and entrepreneurs to quickly look up predictions of improvement rates for specific technologies.”

    Adds Magee: “Our goal is to bring greater accuracy, precision, and repeatability to the as-yet fuzzy art of technology forecasting.” More

  • in

    Study finds lockdowns effective at reducing travel in Sierra Leone

    Throughout the Covid-19 pandemic, governments have used data on people’s movements to inform strategies for containing the spread of the virus. In Europe and the United States, for example, contact-tracing apps have used Bluetooth signals in smartphones to alert people when they’ve spent time near app users who have tested positive for Covid-19. 

    But how can governments make evidence-based decisions in countries where such fine-grained data isn’t available? In recent findings, MIT researchers, in collaboration with Sierra Leone’s government, use cell tower records in Sierra Leone to show that people were traveling less during lockdowns. “When the government implemented novel three-day lockdowns, there was a dual aim to reduce virus spread and also limit social impacts, like increased hunger or food insecurity,” says Professor Lily L. Tsai, MIT Governance Lab’s (MIT GOV/LAB) director and founder. “We wanted to know if shorter lockdowns would be successful.”   

    The research was conducted by MIT GOV/LAB and MIT’s Civic Data Design Lab (CDDL), in partnership with Sierra Leone’s Directorate for Science, Innovation and Technology (DSTI) and Africell, a wireless service provider. The findings will be published as a chapter in the book “Urban Informatics and Future Cities,” a selection of research submitted to the 2021 Computational Urban Planning and Urban Management conference. 

    A proxy for mobility: cell tower records

    Any time someone’s cellphone sends or receives a text, or makes or receives a call, the nearest cell tower is pinged. The tower collects some data (call-detail records, or CDRs), including the date and time of the event and the phone number. By tracking which towers a certain (anonymized) phone number pings, the researchers could approximately measure how much someone was moving around.  

    These measurements showed that, on average, people were traveling less during lockdowns than before lockdowns. Professor Sarah Williams, CDDL’s director, says the analysis also revealed frequently traveled routes, which “allow the government to develop region-specific lockdowns.” 

    While more fine-grained GPS data from smartphones paint a more accurate picture of movement, “there just isn’t a systematic effort in many developing countries to build the infrastructure to collect this data,” says Innocent Ndubuisi-Obi Jr., an MIT GOV/LAB research associate. “In many cases, the closest thing we can use as a proxy for mobility is CDR data.”

    Measuring the effectiveness of lockdowns

    Sierra Leone’s government imposed the three-day lockdown, which required people stay in their homes, in April 2020. A few days after the lockdown ended, a two-week inter-district travel ban began. “Analysis of aggregated CDRs was the quickest means to understanding mobility prior to and during lockdowns,” says Michala Mackay, DSTI’s director and chief operating officer. 

    The data MIT and DSTI received was anonymized — an essential part of ensuring the privacy of the individuals whose data was used. 

    Extracting meaning from the data, though, presented some challenges. Only about 75 percent of adults in Sierra Leone own cellphones, and people sometimes share phones. So the towers pinged by a specific phone might actually represent the movement of several people, and not everyone’s movement will be captured by cell towers. 

    Furthermore, some districts in Sierra Leone have significantly fewer towers than others. When the data were collected, Falaba, a rural district in the northeast, had only five towers, while over 100 towers were clustered in and around Freetown, the capital. In areas with very few towers, it’s harder to detect changes in how much people are traveling. 

    Since each district had a unique tower distribution, the researchers looked at each district separately, establishing a baseline for average distance traveled in each district before the lockdowns, then measuring how movement compared to this average during lockdowns. They found that travel to other districts declined in every district, by as much as 72 percent and by as little as 16 percent. Travel within districts also dropped in all but one district. 

    This map shows change in average distance traveled per trip to other districts in Sierra Leone in 2020.

    Image courtesy of the MIT GOV/LAB and CDDL.

    Previous item
    Next item

    Lockdowns have greater costs in poorer areas

    While movement did decline in all districts, the effect was less dramatic in poorer, more sparsely populated areas. This finding was to be expected; other studies have shown that poorer people often can’t afford to comply with lockdowns, since they can’t take time off work or need to travel to get food. Evidence showing how lockdowns are less effective in poorer areas highlights the importance of distributing resources to poorer areas during crises, which could both provide support during a particularly challenging time and make it less costly for people to comply with social distancing measures. 

    “In low-income communities that demonstrated moderate or low compliance, one of the most common reasons why people left their homes was to search for water,” says Mackay. “A policy takeaway was that lockdowns should only be implemented in extreme cases and for no longer than three days at a time.”

    Throughout the project, the researchers collaborated intimately with DSTI. “This meant government officials learned along with the MIT researchers and added crucial local knowledge,” says Williams. “We hope this model can be replicated elsewhere — especially during crises.” 

    The researchers will be developing an MITx course teaching government officials and MIT students how to collaboratively use CDR data during crises, with a focus on how to do the analysis in a way that protects people’s privacy.

    Ndubuisi-Obi Jr. also has led a training on CDR analysis for Sierra Leonean government officials and has written a guide on how policymakers can use CDRs safely and effectively. “Some of these data sets will help us answer really important policy questions, and we have to balance that with the privacy risks,” he says. More

  • in

    “To make even the smallest contribution to improving my country would be my dream”

    Thailand has become an economic leader in Southeast Asia in recent decades, but while the country has rapidly industrialized, many Thai citizens have been left behind. As a child growing up in Bangkok, Pavarin Bhandtivej would watch the news and wonder why families in the nearby countryside had next to nothing. He aspired to become a policy researcher and create beneficial change.

    But Bhandtivej knew his goal wouldn’t be easy. He was born with a visual impairment, making it challenging for him to see, read, and navigate. This meant he had to work twice as hard in school to succeed. It took achieving the highest grades for Bhandtivej to break through stigmas and have his talents recognized. Still, he persevered, with a determination to uplift others. “I would return to that initial motivation I had as a kid. For me, to make even the smallest contribution to improving my country would be my dream,” he says.

    “When I would face these obstacles, I would tell myself that struggling people are waiting for someone to design policies for them to have better lives. And that person could be me. I cannot fall here in front of these obstacles. I must stay motivated and move on.”

    Bhandtivej completed his undergraduate degree in economics at Thailand’s top college, Chulalongkorn University. His classes introduced him to many debates about development policy, such as universal basic income. During one debate, after both sides made compelling arguments about how to alleviate poverty, Bhandtivej realized there was no clear winner. “A question came to my mind: Who’s right?” he says. “In terms of theory, both sides were correct. But how could we know what approach would work in the real world?”

    A new approach to higher education

    The search for those answers would lead Bhandtivej to become interested in data analysis. He began investigating online courses, eventually finding the MIT MicroMasters Program in Data, Economics, and Development Policy (DEDP), which was created by MIT’s Department of Economics and the Abdul Latif Jameel Poverty Action Lab (J-PAL). The program requires learners to complete five online courses that teach quantitative methods for evaluating social programs, leading to a MicroMasters credential. Students that pass the courses’ proctored exams are then also eligible to apply for a full-time, accelerated, on-campus master’s program at MIT, led by professors Esther Duflo, Abhijit Banerjee, and Benjamin Olken.

    The program’s mission to make higher education more accessible worked well for Bhandtivej. He studied tirelessly, listening and relistening to online lectures and pausing to scrutinize equations. By the end, his efforts paid off — Bhandtivej was the MicroMasters program’s top scorer. He was soon admitted into the second cohort of the highly selective DEDP master’s program.

    “You can imagine how time-consuming it was to use text-to-speech to get through a 30-page reading with numerous equations, tables, and graphs,” he explains. “Luckily, Disability and Access Services provided accommodations to timed exams and I was able to push through.”   

    In the gap year before the master’s program began, Bhandtivej returned to Chulalongkorn University as a research assistant with Professor Thanyaporn Chankrajang. He began applying his newfound quantitative skills to study the impacts of climate change in Thailand. His contributions helped uncover how rising temperatures and irregular rainfall are leading to reduced rice crop yields. “Thailand is the world’s second largest exporter of rice, and the vast majority of Thais rely heavily on rice for its nutritional and commercial value. We need more data to encourage leaders to act now,” says Bhandtivej. “As a Buddhist, it was meaningful to be part of generating this evidence, as I am always concerned about my impact on other humans and sentient beings.”

    Staying true to his mission

    Now pursuing his master’s on campus, Bhandtivej is taking courses like 14.320 (Econometric Data Science) and studying how to design, conduct, and analyze empirical studies. “The professors I’ve had have opened a whole new world for me,” says Bhandtivej. “They’ve inspired me to see how we can take rigorous scientific practices and apply them to make informed policy decisions. We can do more than rely on theories.”

    The final portion of the program requires a summer capstone experience, which Bhandtivej is using to work at Innovations for Poverty Action. He has recently begun to analyze how remote learning interventions in Bangladesh have performed since Covid-19. Many teachers are concerned, since disruptions in childhood education can lead to intergenerational poverty. “We have tried interventions that connect students with teachers, provide discounted data packages, and send information on where to access adaptive learning technologies and other remote learning resources,” he says. “It will be interesting to see the results. This is a truly urgent topic, as I don’t believe Covid-19 will be the last pandemic of our lifetime.”

    Enhancing education has always been one of Bhandtivej’s priority interests. He sees education as the gateway that brings a person’s innate talent to light. “There is a misconception in many developing countries that disabled people cannot learn, which is untrue,” says Bhandtivej. “Education provides a critical signal to future employers and overall society that we can work and perform just as well, as long as we have appropriate accommodations.”

    In the future, Bhandtivej plans on returning to Thailand to continue his journey as a policy researcher. While he has many issues he would like to tackle, his true purpose still lies in doing work that makes a positive impact on people’s lives. “My hope is that my story encourages people to think of not only what they are capable of achieving themselves, but also what they can do for others.”

    “You may think you are just a small creature on a large planet. That you have just a tiny role to play. But I think — even if we are just a small part — whatever we can do to make life better for our communities, for our country, for our planet … it’s worth it.” More

  • in

    Lincoln Laboratory convenes top network scientists for Graph Exploitation Symposium

    As the Covid-19 pandemic has shown, we live in a richly connected world, facilitating not only the efficient spread of a virus but also of information and influence. What can we learn by analyzing these connections? This is a core question of network science, a field of research that models interactions across physical, biological, social, and information systems to solve problems.

    The 2021 Graph Exploitation Symposium (GraphEx), hosted by MIT Lincoln Laboratory, brought together top network science researchers to share the latest advances and applications in the field.

    “We explore and identify how exploitation of graph data can offer key technology enablers to solve the most pressing problems our nation faces today,” says Edward Kao, a symposium organizer and technical staff in Lincoln Laboratory’s AI Software Architectures and Algorithms Group.

    The themes of the virtual event revolved around some of the year’s most relevant issues, such as analyzing disinformation on social media, modeling the pandemic’s spread, and using graph-based machine learning models to speed drug design.

    “The special sessions on influence operations and Covid-19 at GraphEx reflect the relevance of network and graph-based analysis for understanding the phenomenology of these complicated and impactful aspects of modern-day life, and also may suggest paths forward as we learn more and more about graph manipulation,” says William Streilein, who co-chaired the event with Rajmonda Caceres, both of Lincoln Laboratory.

    Social networks

    Several presentations at the symposium focused on the role of network science in analyzing influence operations (IO), or organized attempts by state and/or non-state actors to spread disinformation narratives.  

    Lincoln Laboratory researchers have been developing tools to classify and quantify the influence of social media accounts that are likely IO accounts, such as those willfully spreading false Covid-19 treatments to vulnerable populations.

    “A cluster of IO accounts acts as an echo chamber to amplify the narrative. The vulnerable population is then engaging in these narratives,” says Erika Mackin, a researcher developing the tool, called RIO or Reconnaissance of Influence Operations.

    To classify IO accounts, Mackin and her team trained an algorithm to detect probable IO accounts in Twitter networks based on a specific hashtag or narrative. One example they studied was #MacronLeaks, a disinformation campaign targeting Emmanuel Macron during the 2017 French presidential election. The algorithm is trained to label accounts within this network as being IO on the basis of several factors, such as the number of interactions with foreign news accounts, the number of links tweeted, or number of languages used. Their model then uses a statistical approach to score an account’s level of influence in spreading the narrative within that network.

    The team has found that their classifier outperforms existing detectors of IO accounts, because it can identify both bot accounts and human-operated ones. They’ve also discovered that IO accounts that pushed the 2017 French election disinformation narrative largely overlap with accounts influentially spreading Covid-19 pandemic disinformation today. “This suggests that these accounts will continue to transition to disinformation narratives,” Mackin says.

    Pandemic modeling

    Throughout the Covid-19 pandemic, leaders have been looking to epidemiological models, which predict how disease will spread, to make sound decisions. Alessandro Vespignani, director of the Network Science Institute at Northeastern University, has been leading Covid-19 modeling efforts in the United States, and shared a keynote on this work at the symposium.

    Besides taking into account the biological facts of the disease, such as its incubation period, Vespignani’s model is especially powerful in its inclusion of community behavior. To run realistic simulations of disease spread, he develops “synthetic populations” that are built by using publicly available, highly detailed datasets about U.S. households. “We create a population that is not real, but is statistically real, and generate a map of the interactions of those individuals,” he says. This information feeds back into the model to predict the spread of the disease. 

    Today, Vespignani is considering how to integrate genomic analysis of the virus into this kind of population modeling in order to understand how variants are spreading. “It’s still a work in progress that is extremely interesting,” he says, adding that this approach has been useful in modeling the dispersal of the Delta variant of SARS-CoV-2. 

    As researchers model the virus’ spread, Lucas Laird at Lincoln Laboratory is considering how network science can be used to design effective control strategies. He and his team are developing a model for customizing strategies for different geographic regions. The effort was spurred by the differences in Covid-19 spread across U.S. communities, and what the researchers found to be a gap in intervention modeling to address those differences.

    As examples, they applied their planning algorithm to three counties in Florida, Massachusetts, and California. Taking into account the characteristics of a specific geographic center, such as the number of susceptible individuals and number of infections there, their planner institutes different strategies in those communities throughout the outbreak duration.

    “Our approach eradicates disease in 100 days, but it also is able to do it with much more targeted interventions than any of the global interventions. In other words, you don’t have to shut down a full country.” Laird adds that their planner offers a “sandbox environment” for exploring intervention strategies in the future.

    Machine learning with graphs

    Graph-based machine learning is receiving increasing attention for its potential to “learn” the complex relationships between graphical data, and thus extract new insights or predictions about these relationships. This interest has given rise to a new class of algorithms called graph neural networks. Today, graph neural networks are being applied in areas such as drug discovery and material design, with promising results.

    “We can now apply deep learning much more broadly, not only to medical images and biological sequences. This creates new opportunities in data-rich biology and medicine,” says Marinka Zitnik, an assistant professor at Harvard University who presented her research at GraphEx.

    Zitnik’s research focuses on the rich networks of interactions between proteins, drugs, disease, and patients, at the scale of billions of interactions. One application of this research is discovering drugs to treat diseases with no or few approved drug treatments, such as for Covid-19. In April, Zitnik’s team published a paper on their research that used graph neural networks to rank 6,340 drugs for their expected efficacy against SARS-CoV-2, identifying four that could be repurposed to treat Covid-19.

    At Lincoln Laboratory, researchers are similarly applying graph neural networks to the challenge of designing advanced materials, such as those that can withstand extreme radiation or capture carbon dioxide. Like the process of designing drugs, the trial-and-error approach to materials design is time-consuming and costly. The laboratory’s team is developing graph neural networks that can learn relationships between a material’s crystalline structure and its properties. This network can then be used to predict a variety of properties from any new crystal structure, greatly speeding up the process of screening materials with desired properties for specific applications.

    “Graph representation learning has emerged as a rich and thriving research area for incorporating inductive bias and structured priors during the machine learning process, with broad applications such as drug design, accelerated scientific discovery, and personalized recommendation systems,” Caceres says. 

    A vibrant community

    Lincoln Laboratory has hosted the GraphEx Symposium annually since 2010, with the exception of last year’s cancellation due to Covid-19. “One key takeaway is that despite the postponement from last year and the need to be virtual, the GraphEx community is as vibrant and active as it’s ever been,” Streilein says. “Network-based analysis continues to expand its reach and is applied to ever-more important areas of science, society, and defense with increasing impact.”

    In addition to those from Lincoln Laboratory, technical committee members and co-chairs of the GraphEx Symposium included researchers from Harvard University, Arizona State University, Stanford University, Smith College, Duke University, the U.S. Department of Defense, and Sandia National Laboratories. More