I'm a PhD in Pure Mathematics turned Developer turned Data Scientist.
A typical day of my life includes: data exploration, data transformations and removal of noise, experiment design, machine learning model training, validation, testing and orchestration. My day might also include mentoring peers and support other areas where advanced quantitative knowledge might be required.
I run weekly seminars in Machine Learning and Statistical Methods to provide training to those looking to advance (or switch) their careers in Data Science
Supervised, Unsupervised and Reinforcement Learning
Deep Learning, NLP techniques
AutoML Parameter Tuning, ML Orchestration and Deployment
Probability and Statistics, Stochastic Analysis
Functional Analysis, Advanced Calculus and Linear Algebra
OOP, OO Design, Algorithms and Data Structures
SQL, NoQSL, Data Lakes
MSSQL, PostgreSQL, MySQL, Redshift, Snowflake, CouchDB, MongoDB, S3, HDFS
Periscope, Tableau, matlibplot, ggplot
Managed Analysts and Data Scientists. Bridge between Engineers, Analysts and Executives
Hardcore believer in Radical Candor and Strategic Planning
Continuous Integration / Deployment
Over a decade of experience teaching Calculus, Linear Algebra, Differential Equations, Complex Analysis, etc.
Professional mentoring on SQL, Machine Learning techniques and tools
Jun 2017 - Present
→ Leading the Machine Learning research, development and deployment of company’s segmentation engine
→ Built from the ground up a Data Science Ecosystem to create a CI/CD workflow around Machine Learning and Analytics. The ecosystem leverages our Data Warehouse, together with Databricks’ Spark/Python/R clusters and MLFlow for Machine Learning orchestration and deployment pipelines.
→ Mentoring, through regular seminars, for analysts and engineers with an interest to pursue Data Science as a career
→ Managed BI analysts to provide reporting services. Provided SQL mentoring, implemented coding best practices and developed a workflow to serve the different units of the company
→ Developed and deployed retention logic
→ Wrote premium algorithm for California (Python / PostgreSQL) to pair and test policy administration system
Jan 2016 - Jun 2017
Client in the Retirement Financial Advisor sector
→ Designed and developed (in R) back-end engine for an Efficient Frontier portfolio allocation
Client in the Big Data provider sector
→ Admin support for Hortonworks Hadoop Clusters. Yarn Queue Configurations, Hive functional testing
Jan 2014 - Dec 2015
→ Translated Matlab code prototype from the newly created “Simulink Interface View” into C++ code base
→ Led the development and implementation of the new “Simulink Function Blocks Traceability”:
➠ Helped designed (UML) classes for back and front end. Wrote (C++) the functionality of the back-end classes
➠ Enhanced UI behavior of in-house routing algorithms for these newly added function-caller connections
Aug 2012 - Dec 2013
Researched and developed metrics to understand the behavior of the Semantic Networks arising from in-house NLP algorithms
Enhanced NLP based pattern-matching techniques for recommender system
Implementation and deployment of statistical tools for analysis of massive data
Aug 2007 - Dec 2011
Ann Arbor, MI
Obtained the sharpest description of the Green Current, generalizing all of the available results at the time
→ The Jacobian cocycle and equidistribution towards the Green current (arXiv)
→ Lelong numbers on projective varieties (Ann. Fac. Sci. Toulouse Math. (20) n 4, 2011, pp. 781-800)
Aug 2007 - Dec 2011
Ann Arbor, MI
Equidistribution Towards The Green Current.
Apr 2006 - Jul 2007
Complex Geometry and Pluripotential Theory.
Mar 2003 - Dec 2005
La Medida de Equilibrio en Dinamica Compleja.