Recently, large-scale language data combined with modern machine learning techniques have shown strong value as means for studying human psychology and behavior. For example, language alone has been shown predictive in mental health, personality, and health behaviors. However, many applications for such language-based assessments have readily available and important data beyond language (i.e. extra-linguistics), such as predicting the subjective well-being of a community using tweets, where one can take into account their age, education, and demographic attributes. Language may capture some characteristics while extra-linguistic variables captures others. We believe that effectively integrating linguistic and extra-linguistic data can yield benefits beyond either independently.

In this thesis, we develop methods which effectively integrate extra-linguistic data with language data focused primarily on social scientific applications. The central challenge is dealing with the size and heterogeneity of, often sparse and noisy, language data versus the, often low-dimensional and non-sparse, extra-linguistic variables. First, we consider structured extra-linguistics, like socioeconomic (income and education rates) and demographics (age, gender, etc.), and propose two integration methods, named residualized controls (RC) and residualized factor adaptation (RFA), to be used in county-wise prediction tasks. Demonstrating techniques that integrate information at both the model-level and data-level, we found consistently strong improvement over naively combining features, for example, increasing county level well-being predictions by over 12%. Next, we consider unstructured extra-linguistic data. In the first part, we incorporate social network connections and language over time to propose a novel metric for quantifying the stickiness of words - their ability to spread across friendship connections in a social network over time (or in other words, stick in ones vocabulary after seeing friends use it). We obtain which language features are more probable to disseminate through friendship and show such a metric is useful for predicting who will be friends and what content will spread. In addition, we analyze language content over time by proposing a novel dynamic content-specific topic modeling technique that can help to identify different sub-domains of a thematic scope and can be used to track societal shifts in concerns or views over time.


George Em Karniadakis received his SM and PhD from Massachusetts Institute of Technology. He was appointed lecturer in the Department of Mechanical Engineering at MIT in 1987 and subsequently he joined the Center for Turbulence Research at Stanford/Nasa Ames. He joined Princeton University as assistant professor in the Department of Mechanical and Aerospace Engineering and as associate faculty in the program of applied and computational mathematics. He was a visiting professor at Caltech in 1993 in the Aeronautics Department and joined Brown University as associate professor of applied mathematics in the Center for Fluid Mechanics in 1994. After becoming a full professor in 1996, he continues to be a visiting professor and senior lecturer of Ocean/Mechanical Engineering at MIT. He is an AAAS fellow (2018), fellow of the Society for Industrial and Applied Mathematics (2010), fellow of the American Physical Society (2004), fellow of the American Society of Mechanical Engineers (2003) and associate fellow of the American Institute of Aeronautics and Astronautics (2006). He received the Alexander von Humboldt award in 2017, the Ralf E Kleinman award (2015), the J. Tinsley Oden Medal (2013), and the CFD award (2007) from the US Association in Computational Mechanics. His h-index is 103, and he has been cited over 52,000 times.


Abstract:
Karniadakis will present a new approach to develop a data-driven, learning-based framework for predicting outcomes of physical and biological systems, governed by PDEs, and for discovering hidden physics from noisy data. He will introduce a deep learning approach based on neural networks (NNs) and generative adversarial networks (GANs). He will also introduce new NNs that learn functionals and nonlinear operators from functions and corresponding responses for system identification. Unlike other approaches that rely on big data, here we learn from small data by exploiting the information provided by the physical conservation laws, which are used to obtain informative priors or regularize the neural networks. He will demonstrate the power of PINNs for several inverse problems in fluid mechanics, solid mechanics and biomedicine including wake flows, shock tube problems, material characterization, brain aneurysms, etc., where traditional methods fail due to lack of boundary and initial conditions or material properties. He will also present a new NN, DeepM&Mnet, which uses DeepOnets as building blocks for multiphysics problems, and he will demonstrate its unique capability in a 7-field hypersonics application.  

To register and for more information, click here 

https://stonybrook.zoom.us/j/99820812332?pwd=c05BSTVLNmw3L04yZjdEcG5pem…

Speaker: Alexei Koulakov of Cold Spring Harbor Laboratory Brain evolution as a machine learning problem
We have entered a golden age of artificial intelligence research,
driven mainly by the advances in ANNs over the last decade or so.
Applications of these techniques--to machine vision, speech
recognition, autonomous vehicles, machine translation and many other
domains--are coming so quickly that many observers predict that the
long-elusive goal of Artificial General Intelligence (AGI) is within
our grasp. However, we still cannot build a machine capable of
building a nest, stalking prey, or loading a dishwasher. I will
describe several projects, ranging from theories of evolution of
neural development to the perception of smells, in which we are
attempting to understand the algorithms that the nervous system is
using to solve some of these challenging problems.





https://stonybrook.zoom.us/j/91775729097pwd=Qlc5Nks0NmlyKzJwMjR0S0hrdVZ3QT09

Meeting ID: 917 7572 9097
Passcode: 555459


Abstract: As the saying goes, there are many ways to skin a cat.
While we don't want to go around skinning cats, the world of
optimization is rich with different problems, problem formulations,
and methods and approaches, each with different guarantees and
computational benefits. In this talk we will take a tour down the
problem of structured sparsity in sensing to see how one simple
problem can inspire a wide range of analysis and tools. First, I will
present the optimality conditions for a generalized structured sparse
problem, which can be geometrically visualized as alignment of vectors
and matrices. Then I will introduce three approximation methods for
the problem of phase retrieval, which are a twist on stochastic
gradient and coordinate descent methods. These methods leverage
fundamental numerical linear algebra concepts to give fast approximate
solutions to large-scale problems, which then after postprocessing can
produce more reliable sensing results.

Bio: Yifan Sun received her PhD in Electrical Engineering from the
University of California Los Angeles in 2015, with research focusing
on convex optimization and semidefinite programming. She was then
Technicolor Research and Innovation, focusing on machine learning and
data science applications. More recently, she completed two postdocs,
at the University of British Columbia in Vancouver, Canada and
L'Institut National de Recherche en Informatique et Automatique
(INRIA) in Paris, France.


https://stonybrook.zoom.us/j/94414957054?pwd=V1JMc2EwSnVGMFdaUlNobE9DSHU4dz09#success
ID: 94414957054
Password: 094758

Speaker: Heather J. Lynch


Bio:  Dr. Heather J. Lynch is an Associate Professor of Ecology & Evolution at Stony Brook University. Prior to Stony Brook, Dr. Lynch was an Adjunct Professor of Applied Math and Statistics at UC Santa Cruz and a Research Scientist in the Biology Department at the University Maryland. Dr. Lynch received her A.B. in Physics from Princeton University in 2000, an A.M. in Physics from Harvard University in 2004, and a Ph.D. in Organismal and Evolutionary Biology from Harvard University in 2006. Dr. Lynch's research is focused on spatial population dynamics of Antarctic penguins, with a particular focus on statistical and mathematical models to integrate patchy time series with remote sensing imagery. These data will allow Dr. Lynch and colleagues to develop mathematical models to explore how coloniality constrains the colonization and extinction of individual habitat patches and, ultimately, the metapopulation dynamics of colonial seabirds.   

The Center of Excellence in Wireless and Information Technology (CEWIT) will host the 16th International Conference on Emerging Technologies for a Smarter World (CEWIT2020) virtually on November 5, 2020. The conference will center on the four major fields which are penetrating our business and personal lives: Machine Learning, Artificial Intelligence, Blockchain and Computational Medicine. For more info visit: https://www.cewit.org/.  

Join Zoom Meeting
https://stonybrook.zoom.us/j/91945227869?pwd=emhoZDFWVTV0MVdPWW5uVk43MjQzUT09

Meeting ID: 919 4522 7869
Passcode: 452304
One tap mobile
+16468769923,,91945227869# US (New York)
+13126266799,,91945227869# US (Chicago)

Dial by your location
        +1 646 876 9923 US (New York)
        +1 312 626 6799 US (Chicago)
        +1 301 715 8592 US (Germantown)
        +1 669 900 6833 US (San Jose)
        +1 253 215 8782 US (Tacoma)
        +1 346 248 7799 US (Houston)
        +1 408 638 0968 US (San Jose)
Meeting ID: 919 4522 7869
Find your local number: https://stonybrook.zoom.us/u/aCvAYWkRg