About me
“My philosophy is that not only are you responsible for your life but doing the best at this moment puts you in the best place for the next moment.” - Oprah Winfrey
I am passionate about the digital world. I like to learn more every day about the different areas of technology, both from the people around me and studying. I've dedicated myself to the maximum in my projects. I value respect and humility. My lifelong goal is to help people through all the knowledge acquired.
I am from
Skills
Hard skills
Courses
> Tableau Essentials • LinkedIn
> Getting started with Glue AWS • AWS Skill Builder
> BigQuery for Data Analysis I • Google Cloud Skill Boost
> Product Growth • PM3
> Product Management • PM3
> Customer Experience • Escola Conquer
> Growth Hacking 3.0 • Escola Conquer
> C1, B2, B1, A2, A1 English levels • ABA English
> Statician and Python • Alura
> Business Intelligence • FIAP and Alura
> Design thinking • Alura
> Git and GitHub • Alura
> Office package (Excel, Word, PowerPoint, Access)
> Administrative assistant
> Human resources
and others_
Programming languages and databases
My specialties are_
> SQL (BiqQuery, Synapse, PostgreSQL, Presto, Trino, SparkSQL, SQLite)
> Python (pandas, matplot, seaborn, scikit-learn, etc)
> Scala
> Apache Spark
I've worked with_
> Apache Spark, Scala, Python, SQL
> HTML, CSS, JavaScript, React, Java
> Spring MVC, Spring Boot
I already studied_
> R
> Android and Swift (iOS)
> Visual Basic
> Borland Delphi 7
> Java
> C
Tools and theories
I've already used_
> Versioning tools: Git, Github, Gitlab, SVN
> IDEs: Jupyter Notebook, Anaconda, R Studio, VS Code, NetBeans, Eclipse
> OS: Windows, Linux (Ubuntu, Debian, Lubuntu), Android
> Hardware: Raspberry Pi, FPGA, Arduino
> Data viz tools: Tableau, Looker, DataStudio, Redash, D3.js, Excel, PowerPoint, Google Spreadsheet
> Project management: Jira, Confluence, Trello, Mantis, Miro, Teams, Google meeting, OneTrust
> Clouds: Google, Azure, AWS
> Theories: LPGD, Zen principles of Python, Customer Experience, Design Thinking, fundamentals of software architecture, Scrum
Languages
> Portuguese • Native
> English • Advanced
> Spanish • Intermediate
Certifications
> Big Query for Data Analysis • Google • 2023
> Azure AI • Azure • 2022
> Product Owner • Scrum Inc • 2021 • Credential ID#PO-7620962
> OneTrust Certified Privacy Professional • OneTrust • Credential C24917
Soft skills
> Responsible
> Humble
> Collaborative
> Sense of ownership
> Open-minded
> Data-driven mindset
> Analytical ability
> Organized
> Creative
> Communicative
> Multitasking
> Self-taught
My badges
Experiences
Carrer timeline
2021 Oct • 2023 Mar
DATA ANALYST
Incognia Tecnologia da Informação LTDA
Activities
> Data analysis in the Big Data and B2B context, to identify fraud (chargeback, MPOS, account takeover, among others) and reduce friction in mobile applications, in the login, onboarding, and payments process, based on behavior and location data.
> Construction of Dashboards with Storytelling, for clients, CS, and marketing team.
> Handling terabytes of data with parallel, distributed, and in-memory computing.
> ETL (extraction, transform, and load) processing.
> SDK and APIs data manipulation
Technologies
> SQL, Apache Spark, Scala, Python, Jupyter Notebook and Hub, AWS S3, Github, Trino, Redash, Presto, ScyllaDB
2020 Sep • 2021 Oct
PRODUCT OWNER AND BUSINESS ANALYST
GlobalHitss (outsourced to Claro S.A.)
Goals
> Product Owner of the LGPD Squad for the development of privacy portals
> Consultant for the Digital area to comply with the LGPD.
> Interface between legal, technical, and marketing teams.
Activities
> Creation and maintenance of the Product Backlog
> Story refinement
> Participation in Scrum events
> Monitoring OKRs and KPIs
> Support to Digital squads to collect consent and insert cookies banner.
Technologies
> Jira, Confluence, OneTrust Platform, Google Tag Manager, Google Data Studio, CoBlue, Pipefy, Teams, Excel, PowerPoint.
2020 Feb • 2020 Sep
TEACHER
ETEC - Escola Técnia Estadual de São Paulo
Activities
> Teacher of the Remote Video Editing course - Novotec 2021
Subjects taught
> Content management for social media, Video editing (computerized applications) Multimedia production.
2020 Feb • 2020 Sep
Software engineer
Activities
> Frontend developer of Claro's Privacy Portal web system (https://claro.com.br/privacidade), with a clean code mentality,
> Code Review,
> Tests (unit, integration, functional)
> Deployment tracking
> Follow the proposed organization in the Scrum framework
Technologies
> Scrum, Clean Code, React, JavaScript, HTML, CSS, Mondrian (Claro's Framework), OneTrust, Jenkins, Gitlab, Jira, and Confluence
2018 Apr • 2019 Dec
Fullstack Developer AND Scientific Research
The Database Group (GBD) is a non-profit organization managed by Doctor Carlos Roberto Valêncio, a professor at the IBILCE campus of UNESP (São Paulo State University).
Two strands are worked on: Development and Scientific Research.
Development activities
> Evaluates IBILCE discipline evaluation system for IBILCE: Fullstack development of the system that allows students to evaluate professors, structure, and disciplines taught on campus, as well as providing statistics and visualizations to professors, coordinators and direct on the results obtained
> ETL and data analysis of AVALIA systems
Technologies
Frontend: HTML, CSS, Javascript, Bootstrap, D3.js
Backend: Java, PostgreSQL, pgAdmin, Spring MVC, Apache Tomcat
See more:
https://www.grupogbd.com/PortalGBD/projeto_info?idProjeto=25 https://institucional.grupogbd.com/avaliaibilce/login
Research Activities
> Data cleaning and application of machine learning algorithms to discover knowledge and aid decision-making at the GEP (Medical Records Management and Evolution System), which
attends Hospital Dr. Adolfo Bezerra de Menezes - HABM.
https://www.grupogbd.com/PortalGBD/projeto_info?idProjeto=6
> Development of an environment for text pre-processing and deduplication in Big Data context databases
Research title: Environment for identifying duplicate tuples with apache spark: efficient pre-processing for data cleaning; PROPE/REITORIA Scholarship - Request: 48491;
Technologies
> Natural language processing and machine learning algorithms, SQL, Apache Spark, Java, Google Guava Library, Corpus, and Thesauros.
2018 Apr • Start
Projects
Personal projects
> XXXI Unesp Scientific Initiation Congress
UNESP - Universidade Estadual Paulista "Júlio de Mesquita Filho"
> Organizing Committee of the XXVIII SEMAC - UNESP Computing Week in São José do Rio Preto
UNESP - Universidade Estadual Paulista "Júlio de Mesquita Filho"
> Github projects
> University Restaurant Management Support System - SRU/IBILCE
> Discovery of knowledge for Hospital Dr. Adolfo Bezerra de Menezes (HABM)
Data cleaning in the Big Data scenario and application of Machine Learning algorithms for knowledge discovery and decision making for the GEP (Medical Records Management System) for Hospital Dr. Adolfo Bezerra de Menezes (HABM).
> Development of the AvaliaIbilce System and maintenance of the AVALIA system
Fullstack development of a web system for Unesp University to evaluate professors, disciplines, and resources. The system has a login, personal data editing, evaluation area, reports area, and results visualization, among others. Previously, I modeled the relational database with the entities Student, Course, Teachers, and Disciplines and created the database. Every semester it was necessary to load the data to start the evaluation cycle.
Researches
[PT-BR]
> Lorraine Maria Pepe - Ambiente para identificação de tuplas duplicadas com apache spark: pré-processamento eficiente para limpeza de dados. (Bolsista PROPE/REITORIA - Pedido: 48491). Período: 08/2018 a 07/2019. 2018 - Universidade Estadual Paulista Júlio de Mesquita Filho, Pró-Reitoria de Pesquisa. Orientador: Carlos Roberto Valêncio
> PEPE, L. M.; VALÊNCIO, C. R.. Ambiente para a identificação de tuplas duplicadas com recursos de paralelização: pré-processamento eficiente para limpeza de dados'. XXXI Congresso de Iniciação Científica da Unesp., 2019
Worshops
Some events and workshops:
> Zero to analytics — EDW modernization workshop • Google Cloud
2023, April 04th - Duration: 2h
> BigQuery for data analysis • Google Cloud
2023, March 21th - Duration: 4h
> Big data with Apache Spark • Unesp
2018, October - Duration: 4h