Résumé for Avi Alkalay

Data Scientist & Data Engineer


  • Data Modelling, Advanced SQL
  • Python, Pandas, Data Wrangling
  • Linux, Open Source, Scripting
  • Cloud, SaaS, APIs, Containers
  • DevOps, Agile
  • Data Visualization, Matplot
  • Security and GDPR
  • Statistical Models
  • Machine Learning, Predictive Analytics
  • SciKit, SciPy, Statsmodels, Seaborn


  • English
  • Portuguese
  • Spanish

On the web

Nationalities & Work Permits

Europe (Portugal), Brazil, Israel


Jazz and classical music, Brazilian folklore, modern dance, politics and philosophy


Data professional with strong technical background combined with a long track record of business facing roles. Proven experience in planning and delivering top to bottom solutions across global enterprises and successful startups, in financial services, retail, health sectors, among others.

I’ve architected software to address complex business needs and have written, packaged and containerised production-grade code using multiple technologies. I work well in multidisciplinary teams (data analysts, scientists and engineers, developers, architects and sysadmins), committed to deadlines, best practices and methodology.

I’m an active writer, having published several books and many articles ranging from strategic adoption of cutting edge technology to expert technical knowledge sharing (LinkedIn, personal blog, IBM Thoughts on Cloud).

I’m also a public speaker and innovation evangelist.

Professional Experience

Data Scientist and Engineer at CI&T

June 2019 — Present

Large IT Services company leading Agile methodologies. My current client is Bradesco Seguros. As the senior data engineer in the team, my role is to define and implement standards with good practices, including Data Streams, Data Lakes and Data Warehouses. One of my missions is to guarantee that the end user Portal will automatically get its daily feed of business data from backend transactional systems. Also, to feed analytical data for the line of business BIs. To achieve that, I’ve created complex Python and PowerCenter ETLs to reliably and continuously sync large amounts of data from Google Analytics, Typeform and transactional mainframe databases to data warehouses and data lakes.

Data Science Teacher at Digital House

April 2019 — Present

I lead data science classes from start to end. Responsible for lessons, content and student coordination on the complete journey of the course aiming integration and absorption of all subjects. I deliver complete lessons about BI, advanced SQL, Pandas, SciKit Learn, TensorFlow, Statistical Modelling, Regressions, Time Series, Predictive Analytics, Artificial Intelligence, Full Stack, Innovation, Mobility and Data Strategy in general.

Executive Innovation Architect at IBM

June 1997 — April 2018

In over 20 years I had many customer facing and project delivery roles. My last position was as an executive architect and thought leader for cutting edge tech, such as predictive analytics, blockchain, cloud, mobility, IoT, super-computing, integration, artificial intelligence. I have worked extensively with large global accounts as Hyundai, Avon, Telefonica, Toyota, Ambev, Petrobras, Embraer, Vale, Itaú, Bradesco, Sulamérica, Via Varejo etc.


Data Science, Digital House, São Paulo2018

Computer Science, UNESP1991 to 1995